TITLE OF THE INVENTION 
Preparation of Defined Highly Labeled Probes 



CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority from Provisional Application Serial Nos. 60/427,232. 
60/427,233, and 60/427,234. each filed on November 19, 2002. and each of which is 
Incorporated herein by reference in its entirety. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
Not Applicable. 

REFERENCE TO A SEQUENCE LISTING 
The Sequence Listing, which is a part of the present disclosure, includes a text file 
comprising nucleotide and/or amino acid sequences of the present invention on a floppy disk. 
The subject matter of the Sequence Listing is incorporated herein by reference in its entirety. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to methods for preparing probes for identifying molecules, 
molecular interactions and molecular complexes by direct detection of interaction with one or 
more probes. 

2. Description of the Related Art 

Techniques for detecting DNA sequences 

A variety of DNA hybridization techniques are available for detecting the presence of one 
or more selected polynucleotide sequences in a sample containing a large number of sequence 
regions. In a simple method, which relies on fragment capture and labeling, a fragment 
containing a selected sequence is captured by hybridization to an immobilized probe. The 
captured fragment can be labeled by hybridization to a second probe which contains a 
detectable reporter moiety. 

Another widely used method is Southern blotting. In this method, a mixture of DNA 
fragments in a sample are fractionated by gel electrophoresis, then fixed on a nitrocellulose 
filter. By reacting the filter with one or more labeled probes under hybridization conditions, the 
presence of bands containing the probe sequence can be identified. The method is especially 
useful for identifying fragments in a restriction-enzyme DNA digest which contain a given probe 
sequence, and for analyzing restriction-fragment length polymorphisms (RFLPs). 

Another approach to detecting the presence of a given sequence or sequences in a 
polynucleotide sample involves selective amplification of the sequence(s) by polymerase chain 
reaction. In this method, polymers complementary to opposite end portions of the selected 
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sequence(s) are used to promote, in conjunction with thermal cycling, successive rounds of 
palmer-initiated replication. The amplified sequence may be readily identified by a variety of 
techniques. This approach is particularly useful for detecting the presence of low-copy 
sequences in a polynucleotide-containing sample, e.g., for detecting pathogen sequences in a 
body-fluid sample. 

More recently, methods of identifying known target sequences by probe ligation methods 
have been reported. In one approach, known as oligonucleotide ligation assay (OLA), two 
probes or probe elements which span a target region of interest are hybridized with the target 
region. Where the probe elements match (basepair with) adjacent target bases at the 
confronting ends of the probe elements, the two elements can be joined by ligation, e.g., by 
treatment with ligase. The ligated probe element is then assayed, evidencing the presence of 
the target sequence. 

In a modification of this approach, the ligated probe elements act as a template for a pair 
of complementary probe elements. With continued cycles of denaturation, reannealing and 
ligation in the presence of the two complementary pairs of probe elements, the target sequence 
is amplified geometrically, allowing very small amounts of target sequence to be detected and/or 
amplified. This approach is also referred to as Ligase Chain Reaction (LCR). 

There is a growing need, e.g., in the field of genetic screening, for methods useful in 
detecting the presence or absence of each of a large number of sequences in a target 
polynucleotide. For example, as many as 150 different mutations have been associated with 
cystic fibrosis. In screening for genetic predisposition to this disease, it is optimal to test all of the 
possible different gene sequence mutations in the subject's genomic DNA, in order to make a 
positive identification of a "cystic fibrosis". Ideally, one would like to test for the presence or 
absence of all of the possible mutation sites in a single assay. 

These prior-art methods described above are not readily adaptable for use in detecting 
multiple selected sequences in a convenient, automated single-assay format. It is therefore 
desirable to provide a rapid, single-assay format for detecting the presence or absence of 
multiple selected sequences in a polynucleotide sample. 

Primer Extension 

One of the most powerful and versatile tools available to molecular biologists is the in 
vitro replication of nucleic acid sequences by primer extension, as exemplified by the ubiquitous 
techniques of polymerase chain reaction (PGR) and DNA sequencing. Both techniques include 
the steps of: 1) hybridizing a short, e.g. 15-30 nt, synthetic oligonucleotide primer to a single- 
stranded template nucleic acid; and 2) enzymatically extending from the 3' hydroxyl terminus of 
the primer in the presence of nucleotide 5'-triphosphates, complementary to the template strand, 
and a polymerizing enzyme. By this general primer extension method, sequencing information is 
generated, template nucleic acids are amplified or copied, and other genetic analysis tests are 
conducted. Results are optimized through the choice and concentrations of primers, multiple 
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primers, enzymes, nucleotides, and other reagents, and the selection of temperature, 
temperature cycling conditions, and other experimental conditions. 

The choice of primers has been primarily limited to 2 -deoxyoligonucleotide primers 
made by the phosphoramidite chemistry method on automated synthesizers. Whereas nucleic 
acid analogs are known which efficiently hybridize to DNA or RNA, some with comparable or 
superior hybridization specificity and/or affinity, enzyme-mediated formation of a new 
phosphodiester bond only occurs between a primer having a 3' terminal hydroxyl and a 
nucleotide having a 5 -triphosphate, or a closely related isostere, i.e. .alpha.-thiotriphosphate, 
etc. Most structural permutations in either the primer or the nucleotide severely compromise the 
efficiency of primer extension, or negate it totally. 

Nucleic acid analogs are structural analogs of DNA and RNA and which are designed to 
hybridize to complementary nucleic acid sequences. Through modification of the internucleotide 
linkage, the sugar, and/or the nucleobase, nucleic acid analogs may attain any or all of the 
following desired properties: 1) optimized hybridization specificity or affinity, 2) nuclease 
resistance, 3) chemical stability, 4) solubility, 5) membrane-permeability, and 6) ease or low 
costs of synthesis and purification. 

Peptide nucleic acids (PNA) 

A useful and accessible class of nucleic acid analogs is the family of peptide nucleic 
acids (PNA) in which the sugar/phosphate backbone of DNA or RNA has been replaced with 
acyclic, achiral, and neutral polyamide linkages. The 2-aminoethylglycine polyamide linkage in 
particular has been well-studied and shown to impart exceptional hybridization specificity and 
affinity when nucleobases are attached to the linkage through an amide bond. 

2-Aminoethylglycine PNA oligomers typically have greater affinity, i.e. hybridization 
strength and duplex stability for their complementary PNA, DNA and RNA, as exemplified by 
higher thermal melting values (Tm), than the corresponding DNA sequences. The melting 
temperatures of PNA/DNA and PNA/RNA hybrids are much higher than corresponding 
DNA/DNA or DNA/RNA duplexes (generally 1. degree. C. per bp) due to a lack of electrostatic 
repulsion in the PNA-containing duplexes. Also, unlike DNA/DNA duplexes, the Tm of PNA/DNA 
duplexes are largely independent of salt concentration. The 2-aminoethylglycine PNA oligomers 
also demonstrate a high degree of base-discrimination (specificity) in pairing with their 
complementary strand. Specificity of hybridization can be measured by comparing Tm values of 
duplexes having perfect Watson/Crick complementarity and those with one or more 
mismatches. The degree of destabilization of mismatches, measured by the decrease in Tm 
(.DELTA.Tm), is a measure of specificity. In addition to mismatches, specificity and affinity are 
affected by structural modifications, hybridization conditions, and other experimental 
parameters. The neutral backbone of PNA also increases the rate of hybridization significantly in 
assays where either the target, template, or the PNA probe is immobilized on a solid substrate. 
Without any electrostatic repulsion, the rate of hybridization is often much higher for PNA probes 
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than for DNA or RNA probes in applications such as Southern blotting, northern blots, or in situ 
hybridization experinnents. Unlike DNA, PNA can displace one strand, "strand invasion", of a 
DNA/DNA duplex. With certain DNA sequences, a second PNA can further bind to form an 
unusually stable triple helix structure (PNA).sub.2 /DNA. PNA have been investigated as 
potential antisense agents, based on their sequence-specific inhibition of transcription and 
translation. PNA oligomers themselves are not substrates for polymerase as primers or 
templates, and do not conduct primer extension with nucleotides. 

Linked Nucleic Acids (LNA) 

Locked Nucleic Acids (LNA) monomers are bi-cyclic compounds structurally very similar 
to RNA-monomers, LNA share most of the chemical properties of DNA and RNA. However, 
introduction of LNA monomers into either DNA or RNA oligos results in unprecedented high 
thermal stability of duplexes with complementary DNA or RNA, while, at the same time obeying 
the Watson-Crick base-pairing rules. 

Detection of DNA mutations 

Detection of specific genetic sequences is an area of active research and development. 
However, many problems still exist, such as low levels of signal, small sample size, high sample 
complexity, and the like. Improvements in the ability to provide a multiplicity of labels to a 
specific probe sequence are of interest, particularly using reagents that are compatible with 
standard phosphoramidite synthesis. The present invention addresses these issues. 

The detection of mutations in sequences of DNA is becoming increasingly important in 
medical science. The detection of such a mutation in a DNA sequence typically involves the use 
of an oligodeoxyribonucleotide probe that is complementary to the target DNA sequence. The 
probe is designed to present some moiety, such as a radioactive element, that signals the 
occurrence of hybridization in a filter assay or an electrophoretic gel. The identification of 
hybridization has been used diagnostically for specific bacterial infections by detection of 
Mycobacterium tuberculosis genomic DNA, gonorrhea rRNA, Chlamydia genomic and plasmid 
DNA and Escherichia coli and Bacillus subtilis rRNA. Hybridization assays have also been 
developed for viral detection, including cytomegalovirus (CMV), human papilloma virus (HPV), 
and HIV-1. 

By combining target amplification with allele specific oligonucleotides, small samples of 
human DNA can be analyzed for purposes of genetic screening, including the study of genetic 
changes associated with well-known inherited diseases. For instance, cancers typically display 
familial site-specific clustering. The identification of this kind of clustering can aid in the 
determination of enhanced risk for the development of the particular cancer. In addition, 
hereditary metabolic variations in DNA have been identified that affect the metabolism of known 
carcinogens. A variation that would increase the metabolism of a carcinogen may impact the 
likelihood of the development of cancer and, if developed, the speed of the cancer's growth. 
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Hybridization Probes 

Traditional hybridization methods have been developed which employ radioactive probes 
with separation on filters. While radioactive probes have performed suitably well, growing 
concern over the use of radioactive materials has stimulated a search for alternative probes that 
achieve similar levels of sensitivity and performance without the risks and dangers associated 
with radioactive materials. For instance, biotin has been incorporated into an 
oligodeoxyribonucleotide for use in biotinavidin-linked analyses. In addition, numerous 
modifications of DNA have been used in the development of other alternative probes, including 
links to antibodies, gold-antibodies, mercury for double antibody reactions, eupsoralen, and 
fluorescent dye links for fluorescence detection of hybridization. These alternative methods 
typically allow approximately 10.sup.5 to 10. sup. 6 copies of the DNA to be detected. 

These and related advancements in the art have given rise to several methods of DNA 
mutation detection. These methods include denaturing gradient gel electrophoresis (DGGE), 
single-strand conformational polymorphisms (SSCP), temperature gradient gel electrophoresis 
(TGGE), the heteroduplex method (HET), ribonuclease cleavage, chemical cleavage of 
mismatch (CCU), ligase assay, allele-specific amplification (ASA) dideoxy fingerprinting (ddF), 
and allele-specific oligonucleotides (ASO). DGGE, SSCP, TGGE, HET, and ddF are frequently 
used to locate which exons of a gene contain mutations. 

The currently available non-radioactive methods for detecting mutations in DNA have 
been somewhat problematic. For example, these methods have been generally unable to 
consistently provide accurate results in detecting point mutations in DNA. These detection 
methods have also proven to be time-consuming and quite costly to use. In addition, these non- 
radioactive mechanisms require a significant amount of DNA to perform their detecting function, 
though many times only a small quantity of DNA is available for analysis. Moreover, these 
methods are difficult to use, often requiring complex instruments and highly trained technicians 
not available in many laboratories. Finally, the materials utilized in these methods are generally 
either fragile or prone to degradation during the testing procedure. 

Many detection techniques for nucleic acids utilize short, labeled oligonucleotide probes, 
containing a small number of labels. These have the advantage of being well defined, they have 
limited signal intensity due to the limited number of labels present in the probe. Several 
methods have been described to increase the length and number of labels within probes and 
correspondingly increase the probe intensity. 

Beike et al. (WO 00/462342, PCT/USOO/02897) describe an invention which provides 
novel nucleic acid labeling techniques that generate nucleic acid probes with specific activities at 
least ten fold higher than the levels obtained using standard labeling methods. Specifically, the 
methods of the invention provide methods of producing nucleic probes that each comprises 
multiple labeled nucleotides. The methods can be used to generate RNA, DNA or hybrid probes. 
The invention also provides reaction mixtures and kits for the practice of the methods of the 
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invention and compositions comprising the probes generated according to the methods of the 
invention. The probes described in this method can be either double stranded or single 
stranded and contain a signal domain ranging from 5 to 100 nucleotides in which the label is 
incorporated. 

In another example the probes are primarily double stranded. The application of Shafer 
(WO 00/04192, PCT/US99/16242) describe gene probes including a number of related designs 
for gene probe components, multilinking components and signaling components, all of which are 
modular in nature and can be used together or in part. These components are generally joined 
together in composite structures by hybridization of complementary sub-segments, called 
linkers. The reporters of the present invention are also designed to be conjoined into arrays that 
can provide amplified signaling. The multilinking components of the present invention may be 
interposed between the probe and the reporter units and provide for the binding of multiple 
reporters. These probe and signaling methods also include means to achieve mixed-color 
labeling that is specific to each target. The probes are useful for detecting target sequences in a 
wide variety of formats including, but not limited to, membrane formats, in situ formats, and on 
various solid substrate chip formats. However, these methods generally result in either 
relatively short labeled probes or multimeric probes of varied size and intensity. 

Several methods have been described for generating highly labeled primers using 
synthetic branched structures including Urdea et al. (US 5,681.697), Mandrand et al. (US 
5,695.936) and Gryaznov (US 5,571 ,677) which require complex synthesis methods. In addition 
the resulting probes are likely to have varied size and intensity. 

Using the techniques illustrated above and other described methods, probes have been 
constructed for the detection of nucleic acids, and applied in a variety of different and 
increasingly more sensitive detection methods for individual or multiple nucleic acid targets. 
However, a need exists for materials and methods which will enable analysis of nucleic acids 
(and especially multiple nucleic acid targets) at extremely low levels. Two means that can help 
accomplish this are solution hybridization, which maximizes hybridization effectiveness and 
single molecule detection for maximal sensitivity. This invention describes the production and 
use of highly labeled probes of unitized size, high intensity, charge, and mass for use in 
hybridization, but especially for use in single molecule detection and molecular electrophoresis! 

BRIEF SUMMARY OF THE INVENTION 
Accordingly, it is an object of the invention to overcome problems with the detection of 
nucleic acids in complex mixtures and associated with the related art. These and other objects, 
features and technical advantages are achieved by providing highly labeled probes of exact size 
for use in single-molecule electrophoresis and methods of producing the same. 

This invention provides a method for producing a single-stranded unitized nucleic acid 
probe comprising the acts of: (a) contacting an oligonucleotide primer having a 5' recognition 
end having a length of between about 6 to 50 nucleotides and having a 3* priming end having a 
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length of between about 6 to 50 nucleotides with a fixed-size template having a length between 
101 and about 10,000 nucleotides under reaction conditions conducive to transcribing a unitized 
transcript from the fixed-size template; and (b) labeling the unitized transcript with at least one 
detectable molecule, thereby producing a unitized nucleic acid probe. 

In accordance with a further aspect of the invention, the fixed-size template is between 
1,000 and 9,000 nucleotides in length. In a preferred embodiment, the fixed-size template is 
between 2,000 and 5,000 nucleotides in length, more preferably the fixed-size template is 
between 2,500 and 2,700 nucleotides in length. 

In accordance with a further aspect of the invention the fixed size template is prepared 
by a method selected from the group consisting of restriction digestion of a polynucleotide and 
generation of a polynucleotide by polymerase chain reaction. 

In accordance with a further aspect of the invention the acts (a) and (b) are peri'ormed 
simultaneously. In accordance with yet another aspect of the invention, where act (a) is 
performed before act (b). 

In accordance with a further aspect of the invention, substantially all nucleotides 
comprising the unitized transcript are labeled. In accordance with yet another aspect of the 
invention, the detectable molecule is selected from the group consisting of a dye tag, mass tag, 
radioactive tag, and any combination thereof. In accordance with yet another aspect of the 
invention, the detectable molecule is a dye tag. In accordance with yet another aspect of the 
invention, the unitized transcript is labeled with two or more different dye tags. 

In accordance with a further aspect of the invention, the detectable molecule is a mass 
tag. In accordance with yet another aspect of the invention, the unitized transcript is labeled 
with two or more different mass tags. 

In accordance with a further aspect of the invention, the 5* recognition end is in contact 
with a nucleotide construct selected from the group consisting of LNA, PNA, XLNT, and any 
combination thereof. 

A second aspect of this invention provides a method for method for producing a unitized 
nucleic acid probe comprising the acts of: (a) synthesizing an oligonucleotide primer having a 5* 
recognition end having a length of between about 6 to 50 nucleotides and having a 3* priming 
end having a length of between about 6 to 50 nucleotides; (b) preparing a fixed-size template 
having a length between 101 and about 10,000 nucleotides; (c) contacting the oligonucleotide 
primer and the fixed-size template under reaction conditions conducive to transcribing a unitized 
transcript from the fixed-size template; and (d) labeling the unitized transcript with at least one 
detectable molecule, thereby producing a unitized nucleic acid probe. 

A third aspect of this invention provides a method for producing a single-stranded 
unitized nucleic acid probe comprising the acts of: (a) contacting: a first oligonucleotide primer 
having a 5* recognition end having a length of between about 6 to 50 nucleotides and having a 3' 
priming end having a length of between about 6 to 50 nucleotides; a fixed-size template having 
a length between 101 and about 10,000 nucleotides; a blocking PNA that is complementary to 
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the blocker; and a second oligonucleotide primer complementary to the template under reaction 
conditions conducive to transcribing a unitized transcript from the fixed-size template; and (b) 
labeling the unitized transcript with at least one detectable molecule, thereby producing a 
unitized nucleic acid probe. 

In accordance with a further aspect of the invention, the fixed-size template is between 
1 ,000 and 9,000 nucleotides in length. In a preferred embodiment, the fixed-size template is 
between 2,000 and 5,000 nucleotides in length, more preferably, the fixed-size template is 
between 2,500 and 2,700 nucleotides in length. 

In accordance with a further aspect of the invention, the fixed size template is prepared 
by a method selected from the group consisting of restriction digestion of a polynucleotide and 
generation of a polynucleotide by polymerase chain reaction. 

In accordance with a further aspect of the invention, acts (a) and (b) are performed 
simultaneously. In accordance with yet another aspect of the invention, act (a) is performed 
before act (b). 

In accordance with a further aspect of the invention, substantially all nucleotides 
comprising the unitized transcript are labeled. 

In accordance with a further aspect of the invention, the detectable molecule is selected 
from the group consisting of a dye tag, mass tag, radioactive tag, and any combination thereof. 
In accordance with yet another aspect of the invention, the detectable molecule is a dye tag. In 
accordance with yet another aspect of the invention, the unitized transcript is labeled with two or 
more different dye tags. In accordance with yet another aspect of the invention, the detectable 
molecule is a mass tag. In accordance with yet another aspect of the invention, the unitized 
transcript is labeled with two or more different mass tags. 

In accordance with a further aspect of the invention, the 5' recognition end is in contact 
with a nucleotide construct selected from the group consisting of LNA. PNA, XLNT, and any 
combination thereof. In accordance with yet another aspect of the invention, the XLNT is cross- 
linked to the blocker nucleotide construct. 

A fourth aspect of this invention provides a method for producing substantially double 
unitized nucleic acid probe comprising the acts of: (a) synthesizing a first oligonucleotide primer 
having a 5* recognition end having a length of between about 6 to 50 nucleotides and having a 3* 
priming end having a length of between about 6 to 50 nucleotides, wherein a portion of said 
priming end is complementary a portion of a fixed-size template having a length between 101 
and about 10,000 nucleotides, and wherein a blocker nucleotide construct is in contact with both 
the recognition and the priming sequences, provided that the blocker nucleotide construct is not 
in contact with the complementary portion or the fixed-size template; (b) synthesizing a blocking 
PNA that is complementary to the blocker nucleotide construct; (c) hybridizing the blocking PNA 
to the blocker nucleotide construct; (d) synthesizing a second oligonucleotide primer 
complementary to the template; (e) contacting the primers and blocking nucleotide construct of 
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(a) through (d) under reaction conditions conducive to transcribing a unitized transcript from the 
fixed-size template; and (f) labeling the unitized transcript with at least one detectable molecule, 
thereby producing a unitized nucleic acid probe. 

A fifth aspect of this invention provides a method for producing a substantially double 
stranded unitized nucleic acid probe comprising the acts of: (a) contacting: a probe 
oligonucleotide primer having a 5' recognition end having a length of about 6 to 50 nucleotides 
and having a 3* linking end having a length of about 6 to 50 nucleotides, wherein the 3" linking 
end is complementary to the 3' end of a stitching oligonucleotide having a length of about 6 to 50 
nucleotides a stitching oligonucleotide having a 3' end complementary to the 3' end of the linking 
end and having a 5* end complementary to the 5' end of a label nucleic acid; a double-stranded 
label nucleic acid having a length of about 25,000 to 50,000 nucleotides and having a 5' 
extension; and DNA ligase under reaction conditions conducive to DNA ligation; and (b) labeling 
the unitized transcript with at least one detectable molecule, thereby producing a unitized 
nucleic acid probe. 

In accordance with a further aspect of the invention, acts (a) and (b) are performed 
simultaneously. In accordance with yet another aspect of the invention, act (a) is perfomied 
before act (b). 

In accordance with a further aspect of the invention, the detectable molecule is selected 
from the group consisting of a dye tag, mass tag, radioactive tag, and any combination thereof. 
In accordance with yet another aspect of the invention, the detectable molecule is a dye tag. In 
accordance with yet another aspect of the invention, the unitized transcript is labeled with two or 
more different dye tags. 

In accordance with a further aspect of the invention, the detectable molecule is a mass 
tag. A In accordance with yet another aspect of the invention, the unitized transcript is labeled 
with two or more different mass tags. 

In accordance with a further aspect of the invention, both ends of the label nucleic acid 
fragment have restriction fragment ends complementary to the stitching probe ends. In 
accordance with yet another aspect of the invention, only one end of the label fragment has a 
restriction fragment end complementary to a stitching probe end. 

In accordance with a further aspect of the invention, the recognition end of the probe 
oligonucleotide is composed of LNA of XLNT oligonucleotides. 

In accordance with a further aspect of the invention, the ligated product containing a 
single label nucleic acid fragment and one or two ligated probe oligonucleotides is purified from 
the reaction mixture by a method selected from the group consisting of gel electrophoresis. 
HPLC, CE, and any combination thereof. 

In accordance with a further aspect of the invention, a second restriction endonuclease 
cleaves the label nucleic acid fragment to generate a different 5' restriction enzyme extension 
cut site, and (a) synthesizing a second primer oligonucleotide with different recognition 
sequence at the 3' end and complementary to a second stitching oligonucleotide; (b) 
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synthesizing a second stitching oligonucleotide having a length of 6 to 50 nucleotides whose 3' 
end is complementary to the 3' end of the first primer oligonucleotide and whose 5' end is 
complementary to the 5' end of the label nucleic acid created; and (c) ligating the two primer- 
linker oligonucleotides and the two stitching oligonucleotides to the labeled label nucleic acid 
fragment. In accordance with yet another aspect of the invention, the ligation product consisting 
of a label nucleic acid fragment and a ligated probe fragment is purified from the reaction by a 
method selected from the group consisting of electrophoresis, HPLC, CE, and any combination 
thereof. In accordance with yet another aspect of the invention, ligation nucleic acid fragment 
and ligated probe fragment are purified prior to ligation to the first and second oligonucleotides, 
and wherein only one of the labeled label nucleic acid fragments is used in the ligation step. In 
accordance with yet another aspect of the invention, ligation nucleic acid fragment and ligated 
probe fragment are purified prior to ligation to the first and second oligonucleotides, and wherein 
both labeled fragments are used separate ligation reactions. 

In accordance with a further aspect of the invention, only one end of the label nucleic 
acid fragment is conducive to ligation. 

A sixth aspect of this invention provides a method for producing a unitized nucleic acid 
probe comprising the acts of: (a) synthesizing a probe oligonucleotide primer having a 5* 
recognition end having a length of about 6 to 50 nucleotides and having a 3' linking end having a 
length of about 6 to 50 nucleotides, wherein the 3' linking end is complementary to the 3* end of 
a stitching oligonucleotide having a length of about 6 to 50 nucleotides; (b) synthesizing the 
stitching oligonucleotide having a 3* end complementary to the 3' end of the linking end and 
having a 5' end complementary to the 5* end of a label nucleic acid; (c) synthesizing the double- 
stranded label nucleic acid having a length of about 25,000 to 50,000 nucleotides and having a 
5' extension; (d) labeling the label nucleic acid with at least one detectable molecule; and (e) 
ligating the probe oligonucleotide, stitching oligonucleotide and label nucleic acid fragments of 
(a) through (d) and DNA ligase under reaction conditions conducive to DNA ligation thereby 
producing a unitized nucleic acid probe. 

A seventh aspect of this invention provides for unitized single stranded nucleic acid 
probe comprising, reading from 5' to 3", a 5' recognition end having a length of between about 6 
to 50 nucleotides, a priming sequence having a length of between about 6 to 50 nucleotides and 
a 3' end comprising an extension product complementary to a template having a length between 
101 and about 10,000 nucleotides. In accordance with a further aspect of the invention, the 
fixed-size template is between 1,000 and 9,000 nucleotides in length. In a preferred 
embodiment, the fixed-size template is between 2,000 and 5,000 nucleotides in length, more 
preferably, the fixed-size template is between 2,500 and 2,700 nucleotides in length. 

In accordance with a further aspect of the invention, substantially all nucleotides 
comprising the unitized transcript are labeled. 

In accordance with a further aspect of the invention,the detectable molecule is selected 
from the group consisting of a dye tag, mass tag, radioactive tag, and any combination thereof. 

-10- 



In accordance with yet another aspect of the invention.the detectable molecule is a dye tag. In 
accordance with yet another aspect of the invention, the unitized transcript is labeled with two or 
more different dye tags. 

In accordance with a further aspect of the invention, the detectable molecule is a mass 
tag. In accordance with yet another aspect of the invention, the unitized transcript is labeled 
with two or more different mass tags. 

In accordance with a further aspect of the invention, the 5' recognition end Is in contact 
with a nucleotide construct selected from the group consisting of LNA, PNA, XLNT. and any 
combination thereof. 

An eighth aspect of this invention provides a unitized substantially double stranded 
nucleic acid probe comprising a single stranded 5' recognition end having a length of between 
about 6 to 50 nucleotides, an overlapping double stranded segment having a length of between 
about 4 to 50 nucleotides and a 3* end comprising a double stranded segment, further 
comprising a template and its complement having a length between 101 and about 10,000 
nucleotides. In accordance with a further aspect of the invention, the overlapping double 
stranded segment comprises at least one modified nucleotide. 

In accordance with a further aspect of the invention, the fixed-size template is between 
1 ,000 and 9,000 nucleotides in length. In a preferred embodiment, the fixed-size template is 
between 2,000 and 5,000 nucleotides in length, and more preferably the fixed-size template is 
between 2,500 and 2,700 nucleotides in length. 

In accordance with a further aspect of the invention, substantially all nucleotides 
comprising the unitized transcript are labeled. In accordance with a further aspect of the 
invention, the detectable molecule is selected from the group consisting of a dye tag, mass tag, 
radioactive tag, and any combination thereof. In accordance with yet another aspect of the 
invention, the detectable molecule is a dye tag. In accordance with yet another aspect of the 
invention, the unitized transcript is labeled with two or more different dye tags. 

In accordance with a further aspect of the invention, the detectable molecule is a mass 
tag. In accordance with yet another aspect of the invention, the unitized transcript is labeled 
with two or more different mass tags. In accordance with a further aspect of the invention, the 5* 
recognition end is in contact with a nucleotide construct selected from the group consisting of 
LNA, PNA, XLNT, and any combination thereof. 

These and other features, aspects and advantages of the present invention will become 
better understood with reference to the following description, examples and appended claims. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 
FIG.1. Evaluation of double-stranded probe constructed in EXAMPLE 1. Electrophoresis of 
probe products on a 0.7% agarose gel. Lnl, Ikb DNA Ladder (500ng); In2, 5 DL Seq. Id. No.1, 
3 & 5 including 0.07mM Biotin-16-dUTP after dye coupling reaction; In3, 5 DL Seq. Id. No. 2, 3 & 
5 after dye coupling reaction; In4, Empty In5, 5 DL Seq. Id. No.1, 3 & 5 including 0.07mM Biotin- 
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16-dUTP after MinElute purification; In6. 5 DL Seq. Id. No. 2, 3 & 5 after MinElute purification; 
In9, Ikb DNA Ladder (500 ng). 

FIG. 2. Evaluation of single-stranded probe constructed in EXAMPLE 2. Electrophoresis of 
probe products on a 0.7% agarose gel. 

FIG. 3. Detection of dye labeled single-stranded probe from EXAMPLE 2 on Single Molecule 
instrument. 

FIG. 4. Evaluation of double-stranded probe constructed in EXAMPLE 3. Electrophoresis of 
probe products on a 5% agarose gel. Ln1 , oligo adapter complex/447bp fragment 
heterodimer (466bp+20ss bases); In2, oligo adapter + 447 bp fragment without ligase; In3, oligo 
adapter and ligase without 447bp fragment; In4, 447bp fragment homodimer (898bp). 

FIG. 5. Detection of dye labeled double-stranded probe from EXAMPLE 3 on Single Molecule 
instrument. 

DETAILED DESCRIPTION OF THE INVENTION 
Abbreviations and Definitions 

To facilitate understanding of the invention, a number of terms and abbreviations as 
used herein are defined below as follows: 

The term "label" refers to a moiety that, when attached to the compositions of the 
invention, render such compositions detectable using known detection means, e.g., 
spectroscopic, photochemical, radioactive, biochemical, immunochemical, enzymatic or 
chemical means. Exemplary labels include but are not limited to fluorophores, chromophores, 
radioisotopes, spin labels, enzyme labels and chemiluminescent labels. Such labels allow direct 
detection of labeled compounds by a suitable detector, e.g., a fluorescence detector. In addition, 
such labels include components of multi-component labeling schemes, e.g., a system in which a 
ligand binds specifically and with high affinity to a detectable anti-ligand, e.g.. a labeled antibody 
binds to its corresponding antigen. 

"Linking group" means a moiety capable of reacting with a "complementary functionality" 
to form a "linkage." A linking group and its associated complementary functionality is referred to 
herein as a "linkage pair." Preferred linkage pairs include a first member selected from the group 
isothiocyanate, sulfonyl chloride, 4,6-dichlorotriazinyl, succinimidyl ester, or other active 
carboxylate, and a second member that is amine. Preferably a first member of a linkage pair is 
maleimide, halo acetyl, or iodoacetamide whenever the second member of the linkage pair is 
sulfhydryl. (e.g., R. Haugland, Molecular Probes Handbook of Fluorescent Probes and Research 
Chemicals, Molecular probes. Inc. (1992)). In a particularly preferred embodiment, the first 
member of a linkage pair is N-hydroxysuccinimidyl (NHS) ester and the second member of the 
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linkage pair is amine, where, to form an NHS ester, a carboxylate moiety is reacted with 
dicyclohexylcarbodiimide and N-hydroxysuccinimide. 

The term "Watson/Crick base-pairing" refers to a pattern of specific pairs of nucleotides, 
and analogs thereof, that bind together through sequence-specific hydrogen-bonds, e.g. A pairs 
with T and U, and G pairs with C. 

The term "nucleoside" refers to a compound comprising a purine, deazapurine, or 
pyrimidine nucleobase, e.g., adenine, guanine, cytosine, uracil, thymine, 7-deazaadenine, 7- 
deazaguanosine, and the like, that is linked to a pentose at the 1 '-position. When the nucleoside 
base is purine or 7-deazapurine, the pentose is attached to the nucleobase at the 9-position of 
the purine or deazapurine, and when the nucleobase is pyrimidine, the pentose is attached to 
the nucleobase at the 1-position of the pyrimidine, (e.g., Komberg and Baker, DNA Replication, 
2nd Ed. (Freeman, San Francisco, 1992)). The term "nucleotide" as used herein refers to a 
phosphate ester of a nucleoside, e.g., a triphosphate ester, wherein the most common site of 
esterification is the hydroxyl group attached to the C-5 position of the pentose. The term 
"nucleoside/tide" as used herein refers to a set of compounds including both nucleosides and 
nucleotides. 

The term "polynucleotide" means polymers of nucleotide monomers, including analogs of 
such polymers, including double and single stranded deoxyribonucleotides, ribonucleotides, 
.alpha.-anomeric forms thereof, and the like. Monomers are linked by "internucleotide linkages," 
e.g., phosphodiester linkages, where as used herein, the term "phosphodiester linkage" refers to 
phosphodiester bonds or bonds including phosphate analogs thereof, including associated 
counterions, e.g., H.sup.+, NH.sub.4.sup.+, Na.sup.+. if such counterions are present. 
Whenever a polynucleotide is represented by a sequence of letters, such as "ATGCCTG," it will 
be understood that the nucleotides are in 5' to 3' order from left to right and that "A" denotes 
deoxyadenosine, "C" denotes deoxycytidine, "G" denotes deoxyguanosine, and 'T" denotes 
deoxythymidine, unless otherwise noted. 

"Analogs" in reference to nucleosides/tides and/or polynucleotides comprise synthetic 
analogs having modified nucleobase portions, modified pentose portions and/or modified 
phosphate portions, and, in the case of polynucleotides, modified internucleotide linkages, as 
described generally elsewhere (e.g., Scheit, Nucleotide Analogs (John Wiley, New York, (1980); 
Englisch, Angew, Chem. Int. Ed. Engl. 30:613-29 (1991); Agrawal, Protocols for Polynucleotides 
and Analogs, Humana Press (1994)). Generally, modified phosphate portions comprise analogs 
of phosphate wherein the phosphorous atom is in the +5 oxidation state and one or more of the 
oxygen atoms is replaced with a non-oxygen moiety, e.g., sulfur. Exemplary phosphate analogs 
include but are not limited to phosphorothioate, phosphorodithioate, phosphoroselenoate, 
phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, phosphoramidate, 
boronophosphates, including associated counterions, e.g., H.sup.+. NH.sub.4.sup.+, Na.sup.+, if 
such counterions are present. Exemplary modified nucleobase portions include but are not 
limited to 2.6-diaminopurine, hypoxanthine, pseudouridine, C-5-propyne, isocytosine, 
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isoguanine, 2-thiopyrimidine, and other like analogs. Particularly preferred nucleobase analogs 
are iso-C and iso-G nucleobase analogs available from Sulfonics, Inc., Alachua, Fla. (e.g., 
Benner, et aL, U.S. Pat No. 5,432,272). Exemplary modified pentose portions include but are 
not limited to 2 - or 3*-modifications where the 2 - or 3'-position is hydrogen, hydroxy, alkoxy, 
e.g., methoxy, ethoxy, allyloxy, isopropoxy, butoxy, isobutoxy and phenoxy, azido, amino or 
alkylamino, fluoro, chloro, bromo and the like. Modified internucleotide linkages include 
phosphate analogs, analogs having achiral and uncharged intersubunit linkages (e.g., Sterchak, 
E. P., et aL, Organic Chem, 52:4202 (1987)), and uncharged morpholino-based polymers having 
achiral intersubunit linkages (e.g., U.S. Pat No. 5,034,506). A particularly preferred class of 
polynucleotide analogs where a conventional sugar and internucleotide linkage has been 
replaced with a 2-aminoethylglycine amide backbone polymer is peptide nucleic acid (PNA) 
(e.g., Nielsen et al., Science. 254:1497-1500 (1991); Egholm etal.. J. Am. Chem. Soc. 114: 
1895-1897(1992)). 

As used herein the term "primer-extension reagent" means a reagent comprising 
components necessary to effect an enzymatic template-mediated extension of a polynucleotide 
primer. Primer extension reagents include (1) a polymerase enzyme, e.g., a thermostable DNA 
polymerase enzyme such as Taq polymerase; (2) a buffer; (3) one or more chain-extension 
nucleotides, e.g., deoxynucleotide triphosphates, e.g., deoxyguanosine 5 -triphosphate, 7- 
deazadeoxyguanosine 5 -triphosphate, deoxyadenosine 5'-triphosphate, deoxythymidine 5 - 
triphosphate, deoxycytidine 5 -triphosphate; and, optionally in the case of Sanger-type DNA 
sequencing reactions, (4) one or more chain-terminating nucleotides, e.g., dideoxynucleotide 
triphosphates, e.g., dideoxyguanosine 5 -triphosphate, 7-deazadideoxyguanosine 5 - 
triphosphate, dideoxyadenosine 5'-triphosphate, dideoxythymidine 5'-triphosphate, and 
dideoxycytidine 5 -triphosphate. 

"Mobility-dependent analysis technique" means an analysis technique based on 
differential rates of migration between different analyte species. Exemplary mobility-dependent 
analysis techniques include electrophoresis, chromatography, sedimentation, e.g., gradient 
centrifugation, field-flow fractionation, multi-stage extraction techniques and the like. 

A "target nucleic acid" sequence for use with the present invention may be derived from 
any living, or once living, organism, including but not limited to prokaryote, eukaryote, plant 
animal, and virus. The target nucleic acid sequence may originate from a nucleus of a cell, e.g., 
genomic DNA, or may be extranuclear nucleic acid, e.g., plasmid, mitrochondrial nucleic acid, 
various RNAs, and the like. The target nucleic acid sequence may be first reverse-transcribed 
into cDNA if the target nucleic acid is RNA. Furthermore, the target nucleic acid sequence may 
be present in a double stranded or single stranded form. 

A "probe" refers to a biopolymer comprising a recognition (R) moiety and tail (T) moiety 
of the general form R - B - T where B is a bridge connecting the recognition and tail moieties. 
A labeled probe further comprises one or more detectable label moieties covatenty or non- 
covalently attached to or incorporated in the tail. The recognition moiety is a nucleic acid 
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sequence which complementary to a nucleic acid sequence in the target molecule. The tail 
moiety is a defined length polynucleotide which may or may not include labels and/or mass tags. 

Parameter unit. A measure of a parameter which is discernable above the background 
measurement level of the analytical system, and which is associated with a specific, unitized 
probe (molecule), and which is designated as the base unit of measurement for the specific, 
unitized probe for that parameter. A probe may possess more than one type of parameter unit. 
A probe may have parameter units or subunits for more than one parameter. Measurement of 
more than one parameter may be used to discern (identify) a specific unitized probe or probe 
target. Probes detected simultaneously (by measurement of a parameter) can be discriminated 
from one another (or by inference associated with the target molecule) by measuring one or 
more other parameters (parameter units) possessed by one or more of the detected probes. A 
parameter of the target also may be used to distinguish between probe-targets and probes. 
(The parameter unit for a probe also may be tied to a given set of conditions, e.g. velocity in our 
instrument system). 

Fluorescent or luminescent probes of high intensity are of value in detection, particularly for 
single-molecule detection. It also is desirable to have probes of defined size and defined 
fluorescence, again particularly for single-molecule detection, and especially for use in single- 
molecule electrophoresis. In flow cytometry and single-molecule electrophoresis there is a 
limited observation time (frequently on the order of a few milliseconds) for each molecule as it 
passes through a detector's inten-ogation volume. Consequently, it is important to have probes 
that have both specificity and brightness 

Single molecule detection is the ultimate limit for detection. Within the field of single 
molecule detection, optical-based detection systems (i.e. laser-induced fluorescence) are 
generally considered the most desirable option currently available for detection of biochemicals. 
Optical microscopy was used in the first published documentation of single molecule detection 
(Hirschfeld 1976). Since then, the field of optical detection of single molecules has been rapidly 
advanced by a variety of improvements in methodology. 

A typical system used for detection in fluids involves a fluid flow system similar to flow 
cytometry systems in which fluorescence from a sample stream is measured as the stream 
passes a detector. These methods have been applied for sizing of DNA fragments (Ambrose et 
al. 1993; Castro et al. 1993; Goodwin et al. 1993a; Huang et al. 1996; Johnson et al. 1993; 
Larsen et al. 2000; Petty et al. 1995), for progress toward single molecule DNA sequencing 
(Ambrose et al. 1993; Davis et al. 1991; Goodwin et al. 1995; Goodwin et al. 1996; Goodwin et 
al. 1993b; Harding and Keller 1992; Jett et al. 1989; Soper et al. 1991a; Van Orden et al. 2000; 
Wemer et al. 2003), and to detect DNA hybridization via two-color fluorescence (Castro and 
Williams 1997). A variation of the fluid flow technology involves application of an electric field in 
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which molecules move electrophoretically (Castro and Al. 1995; Castro and Shera 1995; Chen 
and Dovichi 1996; Haab and Mathies 1995; Ma et al. 2001; Shortreed et al. 2000; Soper et al. 
1995b; Van Orden and Keller 1998)and which has been utilized for sizing of DNA. 
Electrophoretic single molecule analysis (molecular electrophoresis) also has been used to 
detect protein-DNA interactions via two-color fluorescence (LeCaptain et al. 2001). To date no 
system has combined the differentiating capabilities and single molecule sensitivity of molecular 
electrophoresis for multiplexed hybridization detection. 

Systems capable of single-molecule detection use a single laser or laser beam to detect 
down to single molecules on surfaces or in solution. Fluorescence correlation spectroscopy and 
other confocal instrumentation interrogate minute volumes, consequently have limited molecular 
throughput, and are not suitable for detecting very low levels of analytes (Nie and Zare 1997). 
Atomic Force Microscopy can detect single molecules, but it also suffers from limited molecular 
throughput (Wrotnowski 2002). These systems also have limited potential for multiplexing. Our 
single molecule electrophoresis instrument offers single molecule detection, high specificity, 
high molecular throughput and extensive multiplexing capabilities. 

Single Molecule Detection System 

Our basic method for ultrasensitive detection relies on single-molecule fluorescence 
analysis. A sample to be analyzed is first obtained from air, water, surfaces or tissue. No 
polymerases, enzymes or proteins, or any amplification processes are necessary so sample 
preparation times and complexity are minimal. This single molecule detection instrument 
provides an ultrasensitive means to reliably detect individual molecules. 

The basic detection scheme is shown in Figure 1. The apparatus consists of two lasers 
(or a single laser source split into two beams), focusing light-collection optics, two single photon 
detectors, and detection electronics under computer control. A sample compartment (not shown) 
contains two reservoirs that hold the solution being analyzed. The reservoirs are connected by 
tubing to a glass capillary cell that is square in cross section. 

The heart of the system is the glass capillary flow cell of the apparatus shown in Figure 
2. Two laser beams (5 pm in diameter) are optically focused about 100 pm apart perpendicular 
to the length of the sample-filled capillary tube. The lasers are operated at particular 
wavelengths depending upon the nature of the detection probe to be excited. The interrogation 
volume of the detection system is determined by the diameter of the laser beam and by the 
segment of the laser beam selected by the optics that direct light to the detectors. The 
interrogation volume is set such that, with an appropriate sample concentration, single 
molecules (single nucleic acid probes or single probe-target hybrids) are present in the 
interrogation volume during each time interval over which observations are made. 

Molecules are pumped through the capillary, or an electric field is applied to the sample 
to move molecules electrophoretically. Under electrophoretic conditions, like molecules move 
through the tube in lockstep (plug flow). As molecules pass through each laser beam, excitation 
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of each fluorescent molecule takes place via one-photon excitation. Within a fraction of a 
second, the excited molecule relaxes, emitting a detectable burst of light. The excitation- 
emission cycle is repeated many times by each molecule in the length of time it takes for it to 
pass through the laser beam allowing the instrument to be able to detect hundreds of molecules 
per second. Photons emitted by fluorescent molecules are registered in both detectors with a 
time delay indicative of the time for the molecule (or molecular hybridization complex) to pass 
from the interrogation volume of one detector to the interrogation volume of the second detector. 
The photon intensity recorded by the detectors is then cross-correlated using a personal 
computer with a digital correlator card. The computer then produces a histogram of velocities 
that shows a peak for every fluorescent species present in the sample. When the sample is 
pumped through the capillary, all molecules move at the same velocity. When an electric field is 
applied to the sample, the transit time between the detectors for each molecule is dependent 
upon the molecule's characteristic charge, size and shape (Castro and Al. 1995; Castro and 
Shera 1995; Castro and Williams 1997). An example of the output of the existing laboratory 
system is shown in Figure 3. 

With the current instrument configuration (5 pm laser beam) approximately 0.25% of the 
fluorescent molecules in the solution pass through the laser beams and are detectable. This 
percentage can be increased by configuring each laser beam such that it forms a narrow band 
perpendicular to the length of the capillary (as shown in Figure 2). Such an arrangement can 
raise the percentage of detectable molecules to approximately 5% of the molecules in the 
solution. Other configurations illuminating larger areas of the capillary have been calculated to 
enable detection of up to 50% of the fluorescent molecules present in a sample. 

The present invention describes methods to make defined nucleic acid probes that are either 
double-stranded or single stranded, yet have a short target recognition sequences for each 
probe. The labeling procedures used in the present invention result in probes that are nearly 
identical in length, dye concentration and specificity. Double stranded probes of this format are 
beneficial as they are twice as bright as single stranded probes of the same length. In addition, 
double stranded probes are less likely to have nonspecific binding to target or nontarget nucleic 
acids. 

The inclusion of LNA or XLNT nucleotides into the recognition sequences of these probes 
increases their value for single molecule analysis. LNA probes have the highest recorded 
affinity for target and, or course, crosslinked XLNT nucleotides cannot be separated from target 
without breaking the covalent bond formed by crosslinking. These high affinities and/or stability 
are important for hybridization reactions where analysis occurs at femtomolar (or lower) 
concentrations of target. Standard oligonucleotides in such reactions have lower affinities and 
decrease the number of hybrids that can be detected while increasing the level of probe needed 
for a successful assay. Higher probe levels also cause problems in single-molecule analysis as 
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they result in higher assay backgrounds and make it difficult, if not impossible, to detect probe- 
target hybridization. 

The method of claim 1 generates single-stranded probes of defined length and defined high dye 
concentration that are bright and have short defined recognition sequences for target nucleic 
acids. The method of claim 6 generates double stranded probes of defined length and defined 
high dye concentration that have similar properties to the single-stranded probes. The method 
of claim 12 generates single-stranded probes of defined length and defined high dye 
concentration from PGR generated templates. The method of claim 1 8 generates double 
stranded probes that have one or two target recognition sequences at each end of the dye- 
labeled segment. The target recognition sequences can be different or identical, be for the 
same, or for different targets. 

Methods of Use 

The present invention further encompasses methods of using probes, as well as 
compositions comprising a plurality of probes, and wherein each probe has a distinctive ratio of 
charge to translational frictional drag, to detect and characterize one or more selected 
nucleotide sequences within one or more target nucleic acids. 

In one aspect, the present invention provides a method of detecting a plurality of 
sequences within one or more target nucleic acids, comprising contacting a plurality of , 
wherein each probe has a structure independently , with one or more target nucleic acids, 
generally under conditions that distinguish those that hybridize to the target nucleic acid, and 
detecting those which have hybridized to the target nucleic acid. 

In one aspect of this method, the target nucleic acids are immobilized. In this aspect, the 
immobilized target nucleic acids are contacted with sequence-specific probes, which further 
comprise a detectable label, under conditions that distinguish those probes having sufficient 
homology to hybridize to the target nucleic acid. The non-hybridized probes are washed away 
and hybridized probes are recovered and detected after denaturation of the base-paired 
structure formed between the sequence-specific probe and the immobilized target nucleic acid. 

In another aspect of this method, the target nucleic acid, which may be immobilized, is 
contacted with a plurality of sequence-specific probe probes whereby two probes hybridize to 
adjacent sequences of the target nucleic acid such that the 5'-end of one probe, which generally 
will carry a 5 -phosphate moiety, abuts the 3'-end of the second probe, so that the two probes 
can be covalently joined to one another, in certain embodiments, with a DNA chemical or 
enzymatic ligating activity, to form a ligated product. In this aspect of the method, the ligated 
product is formed by the joining of two probes, at least one of which comprises a detectable 
label and at least one of which is a sequence-specific probe , such that the ligated product has 
a distinctive ratio of charge to translational frictional drag. In a further aspect, three or more 
probes are hybridized to adjacent sequences of a target nucleic acid in such a manner that at 
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least three probes can be covalently joined to form a ligated product, wherein at least one of the 
probes so joined comprises a detectable label, and at one of the probes so joined is a 
sequence-specific such that the ligated product has a distinctive ratio of charge to translational 
frictional drag. Generally, the ligated product, which is hybridized to the target nucleic acid, is 
released by denaturation, and the ligated product having a distinctive ratio of charge to 
translational frictional drag, is detected and analyzed, to provide information about the selected 
nucleotide sequence within the target nucleic acid. 

This cycle of hybridization, joining, and denaturation, may be repeated in order to amplify 
the concentration of the ligated product formed. In this instance, the joining is optionally 
accomplished by means of a thermostable ligating enzyme. These reactions are conveniently 
carried out in thermal cycling machines with thermally stable ligases. 

Furthermore, additional probes , which together are sufficiently complementary to the 
ligated product to hybridize thereto and be covalently joined to one another as above, are also 
included, thereby affording geometric amplification of the ligated product, i. e., a ligase chain 
reaction (Wu, D. Y. and Wallace B. (1989), The ligation amplification reaction (LAR)- 
Amplification of Specific DNA sequences using sequential Rounds of Template Dependent 
Ligation, Genomics 4:560-569; Barany, (1991), Proc. Natl. Acad. Sci. USA. 88:189; Barany, 
(1991). PGR Methods and Applic, 1:5). To suppress unwanted ligation of blunted ended hybrids 
formed between complementary pairs of the and second oligonucleotides and the second pair 
of oligonucleotides, conditions and agents inhibiting blunted ended ligation, for example 200 mM 
NaCI and phosphate, are included in the ligation reaction. 

The product of such a ligase chain reaction therefore is a double stranded molecule 
consisting of two strands, each of which is the product of the joining of at least two sequence- 
specific probes. Accordingly, in yet another aspect of the present invention, at least one of the 
incorporated within the ligase chain reaction product comprises a detectable label, and at one of 
the is a sequence-specific probe such that the ligase chain reaction product has a distinctive 
ratio of charge to translational frictional drag. 

In another aspect of the oligonucleotide ligase assays described above, mismatches, i.e. 
non-complementary nucleobases, existing between selected nucleotide sequences within the 
target nucleic acid and either or both of the sequence-specific probe and the second 
oligonucleotide interfere with the ligation of the two probes either by preventing hybrid formation 
or preventing proper joining of the adjacent terminal nucleotide residues. Thus, when the binding 
conditions are chosen to permit hybridization of both probes despite at least one mismatch, the 
formation of a ligated product reveals the sequence of the selected nucleotide sequence as it 
exists within the target nucleic acid, at least with respect to the terminal, adjacent residues of the 
two probes. Those skilled in the art are well versed in selecting appropriate binding conditions, 
such as cation concentration, temperature, pH, and oligonucleotide composition to selectively 
hybridize the probes to the selected nucleotide sequences within the target nucleic acid. 
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Since the base pairing of terminal adjacent residues affects ligation, in one embodiment 
the probe providing the 3' terminal nucleobase involved in the joining reaction is designed to be 
perfectly complementary to the target sequence while the probe providing the 5' terminal 
nucleobase residue involved in the joining reaction is designed to be perfectly complementary in 
all but the 5* terminal nucleobase. In another embodiment, the probes are designed such that 
the probe providing the 3* terminal nucleobase is perfectly complementary except for the 3' 
terminal nucleobase residue while the oligonucleotide providing the 5* terminal nucleobase is 
perfectly complementary (Wu, D. Y. and Wallace, B.,(1989), Specificity of nick-closing activity of 
bacteriophage T4 DNA ligase, Gene 76: 245-254; Landegren. U. et al. (1988) A ligase mediated 
gene detection technique, Science 2241: 1077-1080). 

In a modification of the method set forth above, the sequence-specific probe comprises 
a nucleobase sequence that is complementary to the target sequence, but comprises a non- 
terminal mismatch with respect to non-target sequences. In this aspect of the invention, the 
composition of the sequence-specific probe and the nature of the experimental conditions are 
such that the probe will only hybridize to the target sequence. In this embodiment for example, a 
second probe that hybridizes to the target nucleobase, either upstream or downstream of the 
hybridized sequence-specific probe, may be ligated to that probe to form the ligated, product 
that is diagnostic of the presence of the target nucleotide sequence. 

In a further modification of the embodiment set forth above, the sequence-specific probe 
is hybridized to a selected nucleotide sequence within a target nucleic acid that is immediately 
adjacent to the site of interest. A second sequence-specific probe is hybridized to the selected 
region within the target nucleic acid such that the hybridized oligonucleotides are separated by a 
gap of at least one nucleotide residue. In another embodiment, the length of the gap is a single 
nucleotide residue representing a single polynucleotide polymorphism in the target nucleic acid. 
Following hybridization, the complex, which consists of the two probes hybridized to the target 
nucleic acid, is treated with a nucleic acid polymerase in the presence of at least one 
deoxyribonucleoside triphosphate. If the deoxyribonucleoside triphosphate(s) provided are 
complementary to the target polynucleotide's nucleotide residues which define the gap, the 
polymerase fills the gap between the two hybridized probes. Subsequent treatment with ligase 
joins the two hybridized oligonucleotides to form a ligated, product, which can, in one 
embodiment, be separated from the template by thermal dissociation, thereby providing a 
diagnostic product having a distinctive ratio of charge to translational frictional drag. This 
diagnostic product will generally comprise a reporter molecule, which may be included within 
either of the ligated probes , be attached to the one or more nucleobases added by the 
polymerizing activity, or be added subsequent to the covalent joining of the probes . By treating 
with polymerase in the presence of fewer than four nucleoside triphosphates, the nucleotide 
residues comprising the gap may be determined. Further amplification of ligated product is 
achieved by repeated cycles of denaturation, annealing, nucleic acid polymerase gap filling, and 
ligation in the presence of at least one of the nucleoside triphosphates. 
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If the treatment with nucleic acid polymerase occurs in the presence of one labeled 
nucleoside triphosphate or a mixture containing one labeled and 3 unlabeled nucleoside 
triphosphates, ligated products comprising at least one incorporated, labeled nucleoside are 
readily detected upon electrophoretic separation of the labeled ligated products. Modification of 
the nucleotide mixture to one having one labeled nucleoside triphosphate and three chain 
terminating nucleoside triphosphates suppresses unwanted ligation of oligonucleotides with 
incorrectly incorporated nucleotide residues. 

Probes of the present invention are also useful as primers for nucleic acid sequence 
analysis by the chain termination method, a method well known to those skilled in the art. In one 
embodiment, probes are hybridized to target nucleic acid and extended by a nucleic acid 
polymerase in the presence of a mixture of nucleoside triphosphates and a chain terminating 
nucleoside triphosphate. The polymerase reaction generates a plurality of chain terminated 
nucleic acids fragments, which are separated, for example by capillary electrophoresis. Chain 
termination by the incorporated chain terminating nucleoside triphosphate identifies the 3' 
terminal residue of the terminated nucleic acid fragment. 

For the purposes of detecting the chain terminated species, various substituents of the 
nucleic acid fragments are amenable to conjugation with detectable reporter molecules. These 
include the nucleoside triphosphate precursors, including the chain terminators, incorporated 
into the nucleic acid. Detectable reporter molecules may be radioactive, chemiluminescent, 
bioluminescent, fluorescent, or ligand molecules. In one embodiment, the detectable label is a 
fluorescent molecule, for example fluorescein isothiocyanate, Texas red, rhodamine, and 
cyanine dyes and derivatives thereof. In another embodiment, the fluorescent dyes are to 
reduce the variations in electrophoretic mobility of nucleic acids caused by the fluorescent label 
(see e.g. Ju, J. et al. (1995). Design and synthesis of fluorescence energy transfer dye-labeled 
primers and their application for DNA sequencing and analysis. Anal. Biochem. 231: 131-40; 
Metzker, et al. (1996) Electrophoretically uniform fluorescent dyes for automated DNA 
sequencing. Science 271: 1420-22; Hung, S. C. et al. (1997) Comparison of fluorescence 
energy transfer primers with different donor-acceptor dye combinations. Anal Biochem. 252: 77- 
88; Tu, O. et al. (1998) The influence of fluorescent dye structure on the electrophoretic mobility 
of end-labeled DNA, Nucleic Acids Res. 26: 2797-2802) 

In one embodiment, the of the present invention are used within a format for 
sequencing selected regions within a target polynucleotide wherein one of four spectrally 
resolvable fluorescent molecules is used to label the nucleic acid fragments in reactions having 
one of four chain terminating nucleoside triphosphates. In another aspect of this embodiment, 
the probes of the present invention are used for sequencing selected regions within a target 
polynucleotide wherein one of four spectrally resolvable fluorescent molecules is used to label 
an oligonucleotide primer in a reaction containing one of four chain terminating nucleoside 
triphosphates. Thus, in both aspects of this embodiment, detecting the fluorescent color of the 
chain terminated nucleic acid fragment identifies the 3' terminal nucleotide residue. Separation 
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of the chain-terminated products by electrophoresis, typically in a single gel lane or capillary, 
along with simultaneous on-line detection of four spectrally resolvable fluorescent molecules 
allows rapid sequence determination from the colors of the separated nucleic acid fragments 
(Prober, J. M. et al. (1985), A System for Rapid DNA Sequencing with Fluorescent Chain 
Terminating Dideoxynucleotides. Science 238: 336-341; Karger, A. E. et al., (1991), 
Multiwavelength Fluorescence Detection for DNA Sequencing Using Capillary Electrophoresis, 
Nucleic Acids Res. 19 (18):4955-62). 

When the nucleotide sequence of interest is a small region of the target nucleic acid, for 
example a site including single nucleotide polymorphism, modified sequencing formats, 
optionally, are used. In one such embodiment, a sequence-specific probe is hybridized in a 
sequence-specific manner such that the 3 -terminal nucleotide residue of the sequence-specific 
probe is immediately adjacent to the site of interest. The hybridized probe is extended by a 
nucleic acid polymerase in the presence of at least one chain terminating nucleoside 
triphosphate extends the oligonucleotide by one nucleotide if the chain terminating nucleotide is 
complementary to the target nucleic residue immediately downstream of the 3*-terminus of the 
hybridized sequence-specific probe . Separation and detection of the extended, sequence- 
specific sequence-specific probe provides the identity of the residue immediately adjacent to 
the hybridized primer. In this embodiment, the use of a plurality of different probe permits the 
simultaneous detection and analysis of a plurality of target sequences in a single separation. 

Detecting the extended primer is accomplished by including a reporter molecule 
conjugated to the extended, sequence-specific probes are used as primers in the same manner 
as described above for standard sequencing reactions. Thus, in one embodiment, the chain 
terminating nucleoside triphosphate is labeled with one of four spectrally resolvable fluorescent 
molecules such that the fluorescent label uniquely identifies the chain terminating nucleotide. 
The composition of the residue immediately adjacent to the hybridized oligonucleotide primer is 
then readily ascertained from the colors of the extended oligonucleotide primer. As will be 
apparent to those skilled in the art, this modified sequencing format is adaptable to other 
mixtures of fluorescently labeled chain terminating nucleoside triphosphates. Thus the 
embodiments encompass nucleotide combinations having two or four chain terminating 
nucleoside triphosphates wherein only one chain terminator is labeled with one of four 
resolvable reporter labels. Mixing the products of the extension reactions, followed by separation 
and detection of the extended products in a single gel lane or capillary provides the ability to 
determine all possible sequence variations at the nucleotide residue adjacent to the hybridized 
primer. Further increase in sensitivity of the methods are possible by using substantially 
exonuclease-resistant chain terminators, such as those which form thio-ester internucleotide 
linkages, to reduce removal of incorporated chain terminators by polymerase associated 
exonuclease. 

In another embodiment, are used in polymerase chain reactions (PCR) to detect and 
amplify selected nucleotides within one or more target nucleic acids (Mullis, K.. U.S. Pat. No. 
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4,683.202; Saiki. R. K., et al., Enzymatic Amplification of .beta.-Globin Genomic Sequences and 
Restriction Site Analysis for Diagnosis of Sickle Cell Anemia, In PGR: A practical approach, M. 
J. McPherson, P. Quirke, and G. R. Taylor, Eds., Oxford University Press, 1991). In this aspect 
of the present invention, the detection method involves PGR amplification of nucleotide 
sequences within the target nucleic acid. In this aspect, a target nucleic acid, which may be 
immobilized, is contacted with a plurality of , two of which hybridize to complementary strands, 
and at opposite ends, of a nucleotide sequence within the target nucleic acid. Repeated cycles 
of extension of the hybridized sequence-specific oligonucleotides, optionally by a thermo- 
tolerant polymerase, thermal denaturation and dissociation of the extended product, and 
annealing, provide a geometric expansion of the region bracketed by the two probes. The 
product of such a polymerase chain reaction therefore is a double-stranded molecule consisting 
of two strands, each of which comprises a sequence-specific probe. In this aspect of the present 
invention, at least one of the sequence-specific oligonucleotides is a sequence-specific probe 
such that the double stranded polymerase chain reaction product has a distinctive ratio of 
charge to translational frictional drag. The polymerase chain reaction product formed in this 
aspect of the invention further comprises a label, which may be incorporated within either of the 
sequence-specific probes used as primers, or it may be incorporated within the substrate 
deoxyribonucleoside triphosphates used by the polymerizing enzyme. In yet another aspect, the 
polymerase chain reaction product formed is analyzed under denaturing conditions, providing 
separated single stranded products. In this aspect, at least one of the single stranded products 
comprises both a label and a sequence-specific primer such that the single-stranded product 
derived from double stranded polymerase chain reaction product has a distinctive ratio of charge 
to translational frictional drag. As is well known in the art, such a single-stranded product may 
also be generated by carrying out the PGR reaction with limiting amounts of one of the two 
sequence-specific probes used as a primer. By using distinctive sequence-specific nucleic 
acids or probes as primers, the PGR reaction can detect many selected regions within one or 
more target polynucleotides in a single assay by allowing separation of one PGR product from 
another. Moreover, those skilled in the art will recognize that using various combinations of 
primers provides additional ways to generate distinctive PGR products. For example, a 
combination of a probe and a second primer pair in the PGR reaction generates a PGR product 
with a single strand. On the other hand, a combination of a probe and a second probe, which is 
also mobility-modified, generates a PGR product having both strands that are mobility-modified, 
thus distinguishing itself from the PGR product with one strand. Thus, by varying the type of 
mobility-modifying group and the nucleic acid strands that are mobility-modified, the 
embodiments enlarge the capacity to detect multiple target segments. 

Detection of the PGR products may be accomplished either during electrophoretic 
separation or after an electrophoretic separation. Intercalating dyes such as ethidium bromide, 
ethidium bromide dimers, SYBR.RTM. Green, or cyanine dye dimers such as TOTO. YOYO and 
BOBO are available for post separation detection (Haugland, R. P, Handbook of Fluorescent 

-23- 



Probes and Research Chemicals, S.sup.th ed, Molecular Probes. Inc.. 1996). Alternatively, the 
PGR products further comprise reporter molecules, including but not limited to radioactive, 
chemiluminescent, bioluminescent, fluorescent, or ligand molecules that permit detection either 
during or subsequent to an electrophoretic separation. Methods for labeling the PGR products 
follow the general schemes presented for labeling in other methods described infra. 

Detecting a selected nucleotide sequence within a target nucleic acid by PGR 
amplification also encompasses identifying sequence variations within segments of the target 
nucleic acid. These variations include, among others, single nucleotide polymorphisms and 
polymorphisms in variable nucleotide tandem repeats (VNTR) and short tandem repeats (STR), 
such as those defined by sequence tag sites (STS). Identifying polymorphic loci are of particular 
interest because they are often genetic markers for disease susceptibility (see e.g. Gastier, J. 
M., (1995), Hum Mol Genet, 4(10): 1829-36; Kimpton, G. P., (1993), Automated DNA profiling 
employing multiplex amplification of short tandem repeat loci, PGR Methods Appl., 3(1): 13-22). 
If the polymorphisms relate to variations in VNTR or STR sequences, direct analysis of PGR 
products without further treatment suffices for detecting polymorphisms since the products differ 
in nucleotide length. The presence of PGR products, however, expands the capability of the 
PGR analysis to detect multiple polymorphic loci in a single reaction. 

If the polymorphisms relate to single nucleotide differences, the variations are detectable 
by conducting PGR reactions using primers designed to have mismatches with the selected 
nucleic acid sequence within a target nucleic acid. The presence of intentional mismatches 
within the duplex formed by hybridization of the primer and the selected nucleic acid sequence 
within the target nucleic acid affects the thermal stability of those duplex molecules, which is 
reflected in the T.sub.m of those structures and thus, under selected conditions, results in 
preferential amplification of one target segment as compared to another. Such allele-specific 
polymerase chain reactions permit identification of mutations in single cells, or tissues 
containing a low copy number of one selected nucleotide sequences amongst a high 
background of other nucleotide seqeunces within one or more target nucleic acids (Gha, R. S., 
(1993), Mismatch amplification mutation assay (MAMA): application to the c-H-ras gene., PGR 
Methods Appl., 2(1) 14-20; Glaab., W. E. et al., (1999), A novel assay for allelic discrimination 
that combines fluorogenic 5' nuclease polymerase chain (TaqMan.RTM.) and mismatch 
amplification mutation., Mutat. Res. 430: 1-12). 

In yet another aspect, single nucleotide differences are distinguished through analysis of 
higher order conformations of single stranded nucleic acids that form in a sequence dependent 
manner. In this embodiment, single stranded nucleic acids are generated by dissociating the 
PGR products into single strands, or by preferentially amplifying one strand by using limiting 
amounts of one primer in the PGR reaction (ie. single-sided PGR). Under selected conditions, 
the single stranded nucleic acids are allowed to form higher order structures by intramolecular 
hydrogen bonding of the single stranded nucleic acid. Those skilled in the art are well versed in 
defining such permissive conditions (i.e. temperature, denaturant concentration, pH, cation 
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concentration etc.) for forming the higher order structures. These conformations, which are 
sequence dependent and which therefore can be extremely sensitive to single nucleotide 
changes, affect the electrophoretic mobility of the nucleic acid, and thus reveal variation in a 
selected nucleotide sequence within a target nucleic acid by their unique electrophoretic mobility 
profiles. To enhance formation of higher order structures modifications are introduced into the 
primers used for the PGR reactions. For example, a primer is engineered with additional bases 
complementary to a part of the selected nucleotide sequence within a target nucleic acid 
containing the sequence variation, such that higher order conformations form when the 
additional bases on the primer "snapback" or re-anneal to the normal sequence but not to 
variant sequences (Wilton, S. D. (1998), Snapback SSCP analysis: engineered conformation 
changes for the rapid typing of known mutations, Hum. Mutat. 11 (3): 252-8). Since the reliability 
of detecting single nucleotide variations is affected by size of the single stranded probe , 
conformation analysis using , with each having a distinctive ratio of charge to translationai 
frictional drag, permits detection of a plurality of selected nucleotide sequences within a target 
nucleic acid while maintaining the optimal length needed for forming higher order structures 
(Sheffield, V. C, (1993), The sensitivity of single stranded conformation polymorphism analysis 
for the detection of single base substitutions. Genomics 16 (2): 325-32). 

In yet another aspect of the invention, probes are cleaved to detect selected nucleotide 
sequences within one or more target nucleic acids. In one such embodiment, probes are 
hybridized to selected nucleotide sequences within one or more target nucleic acids. In another 
embodiment, PGR products comprising at least one sequence-specific probe serve as 
substrates for sequence-specific enzymes, such as restriction enzymes. Digestion of the 
substrates by the enzymes creates cleaved products having a distinctive ratio of charge to 
translationai frictional drag, which provides information about sequence composition of the 
target polynucleotides. This form of restriction fragment length polymorphism (RFLP) analysis is 
well known to those skilled in the art (see e.g., Kidd, 1. M., (1998), A multiplex PGR assay for the 
simultaneous detection of human herpesvirus 6 and human herpesvirus 7, with typing of HHV-6 
by enzyme cleavage of PGR products, J. Virol. Methods 70 (1): 29-36; Gelernter, J., (1991), 
Sequence tagged sites (STS) Taq I RFLP at dopamine beta-hydroxylase, Nucleic Acids. Res. 
19(8): 1957). 

In another aspect, probes hybridized to selected nucleotide sequences within a target 
nucleic acid, wherein there is at least one nucleobase not complementary to the corresponding 
nucleobase in the target nucleic acid, are treated with agents that specifically cleave the non- 
base-paired nucleotide residues. Generally, the unpaired residue occurs on the hybridized probe 
(Bhattacharya, etal., (1989), Nucleic Acids. Res. 17. 6821-6840). Although chromosomal DNA 
may serve as the target nucleic acids, target nucleic acids are cloned DNA fragments 
comprising selected nucleotide sequences of a target nucleic acid, or PGR amplification 
products comprising selected nucleotide sequences of a target nucleic acid. 
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Cleavage may be accomplished with either chemical or enzymatic reagents. In chemical 
cleavage reactions, the hybrids containing at least one non-complementary nucleobase, are 
treated with chemicals which specifically modify the unpaired residue, rendering the 
internucleotide linkage of the modified nucleoside susceptible to hydrolysis. Suitable chemical 
agents include but are not limited to carbodiimide, osmium tetraoxide, hydroxylamine or 
potassium permanganate/tetraethylammonium chloride (Ellis, T. P., et al., (1998), Chemical 
cleavage of mismatch: a new look at an established method. Hum Mutat. 1 1 : 345-53; Roberts, 
E., (1997), Potassium permanganate and tetraethylammonium chloride are safe and effective 
substitute for osmium tetraoxide in solid phase fluorescent chemical cleavage mismatch, Nucleic 
Acids. Res. 25: 3377-78). The use of potassium permanganate/tetraethylammonium chloride 
rather than osmium tetraoxide enhances cleavage at T/G mismatched pairs. 

Enzymatic cleaving reagents encompass a variety of nucleases which recognize 
unpaired regions. These include but are not limited to single stranded specific nucleases such 
as SI nuclease from Aspergillus oryzue, PI from Penicillum citrinum, and mung bean nuclease 
(Shenk, et al., (1975) Proc. Natl. Acad. Sci. USA 72 989-93). Although these nucleases are less 
reactive towards single nucleotide mismatches, they can digest unpaired residues created by 
longer insertions and deletions (Dodgson, J. B. et al., (1977), Action of single-stranded specific 
nucleases on model DNA heteroduplexes of defined size and sequence, Biochemistry, 16:2374- 
49). Cel 1 and SP endonucleases show activity toward unpaired nucleotide residues resulting 
from nucleotides sequence variations comprising deletions, insertions, and missense mutations, 
within selected nucleotide sequences of target nucleic acids. (Oleykowski, C. A., (1998) 
Mutation detection using a novel plant endonuclease. Nucleic Acids. Res. 26: 4597-602; Yeung, 
A. T.,U.S. Pat. No. 5,869,245). Resolvases from various sources, such as bacteriophage and 
yeast, represent yet another class of cleaving enzymes useful in this embodiment of the 
invention. Representative examples of resolvases include but are not limited to phage encoded 
T4 endonuclease VII and T7 endonuclease I, both of which cleave at mismatches (Cotton, R. G. 
H., U.S. Pat. No. 5,958,692; Solaro, et al, (1993), Endonuclease VII of Phage T4 Triggers 
Mismatch Correction in vitro. J. Mol. Biol. 230: 868-877; (Chang, D. Y. et al., (1991), Base 
mismatch specific endonuclease activity in extracts from Saccharomyces cerevisiae; Nucleic 
Acids Research 19 (17): 4761-66). 

In another aspect, of the present invention encompasses methods that prevent cleavage 
at unpaired residues. Proteins, including but not limited to the MutS protein of E. coli., bind to 
sites of single nucleotide mismatches in duplex nucleic acid structures (Su, S. S. et al., (1986), 
Escherichia coli mutS encoded protein binds to mismatched DNA base pairs, Proc. Natl Acad. 
Sci. USA 83: 5057-5061). The MutS protein is part of the methylation directed E coli. MutH/S/L 
mismatch repair system, homologs of which are present in other bacteria, yeast and mammals 
(Eisen, J. A., (1998), A phylogenetic study of the MutS family of proteins, Nucleic Acids. Res. 
26: 4291-300; Alani, E. (1996), The Saccharomyces cerevisiae Msh2 and Msh6 proteins form a 
complex that specifically binds to duplex oligonucleotides containing mismatched DNA base 
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pairs, Mol. Cell Biol. 16: 5604-15; Modrich, P. etal., (1996), Mismatch repair in replication 
fidelity, genetic recombination and cancer biology, Annu. Rev. Biochem. 65: 101-33). Therefore 
in one embodiment of the invention, duplex structures comprising at least one non-base-paired 
nucleobase unit formed by hybridization of a sequence-specific probe with a selected 
nucleotide sequence within a target nucleic acid, are treated with mismatch binding proteins 
such as MutS and then exposed to one or more exonucleases which degrade the duplex 
strands in a unidirectional fashion. A bound mismatch binding protein inhibits further action of 
the exonuclease on the strand containing the mismatch, thereby providing nucleic acid products 
of defined length and which possess a distinctive ratio of charge to translational frictional drag 
(Ellis, L. A., (1994), MutS binding protects heteroduplex DNA from exonuclease digestion in 
vitro: a simple method for detecting mutations. Nucleic Acids Res. 22 (13):2710-1; Taylor, G. R., 
U.S. Pat. No. 5,919,623). Unidirectional exonucleases suitable for use in this assay include, but 
are not limited to exonuclease III, bacteriophage .lambda, exonuclease, and the 3' to 5* 
exonucleases of T7 DNA polymerase, T4 DNA polymerase, and Vent (tm) DNA polymerase. 

In yet another aspect, a sequence-specific probe is used in a cleavage based method 
of detecting selected nucleotide sequences within a target nucleic acid may be a DNA-RNA- 
DNA probe , where an internal RNA segment is flanked by DNA segments. This tripartite 
sequence-specific probe is hybridized to a selected nucleotide sequence within a target nucleic 
acid at a temperature below the Tm of the overall, i.e. tripartite probe. Digestion of this duplex 
structure with an appropriate RNase, hydrolyzes only the RNA portion of the DNA-RNA-DNA 
probe when hybridized to a DNA template. In one embodiment, the RNase is a thermo-stable 
RNase H (Bekkaoui, F., (1996), Cycling probe technology with RNase H attached to an 
oligonucleotide, Biotechniques, 20 (2): 240-8). If the temperature of the reaction maintained 
above the Tm of the flanking DNA segments remaining after digestion of the internal RNA 
segment, those DNA segments dissociate, thus allowing another DNA-RNA-DNA oligomer to 
associate with the target polynucleotide. Repeated hybridization, RNA cleavage, and 
dissociation of the flanking DNA segments amplifies the level of detectable dissociated DNA 
segments. The reaction temperature, in one embodiment, is held constant during the 
amplification process, thus obviating any need for thermal cycling (Duck, P. (1990), Probe 
amplifier system based on chimeric cycling oligonucleotides, Biotechniques 9: 142-48; 
Modrusan, Z. (1998) Spermine-mediated improvement of cycling probe reaction, Mol. Cell 
Probes 12: 107-16). In this aspect of the present invention, the sequence-specific DNA-RNA- 
DNA probe used comprise at least one mobility-modifying polymer and at least one reporter 
molecule attached to either or both of the flanking DNA segments, thereby providing a labeled 
digestion product having a distinctive ratio of charge to translational frictional drag. 

In another aspect, detection of selected nucleotide sequences within one or more target 
nucleic acids based on cleavage of a probe relies upon cleavage substrates formed by invasive 
hybridization, as described in Brow et al. U.S. Pat. No. 5.846,717, In this embodiment, the 5- 
portion of a sequence-specific probe, which comprises a reporter molecule and which is 
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hybridized to a target nucleic acid, is displaced by a second probe that hybridizes to the same 
region and thereby exposing that displaced sequence to cleavage with a cleaving reagent. In 
practicing the embodiment, the target nucleic acid is contacted with a sequence-specific probe 
and with a second probe. The sequence-specific probe has a 5 -segment complementary to a 
second region of the selected nucleotide sequence contained within a target nucleic acid and a 
3 -portion complementary to a third region of the selected nucleotide sequence contained within 
a target nucleic acid, wherein the second region is downstream from the third region. The 
second probe has a 5 -segment complementary to a first region of the selected nucleotide 
sequence contained within a target nucleic acid and a 3'-segment complementary to the second 
region of the selected nucleotide sequence contained within a target nucleic acid, wherein the 
first region is downstream from the second region. Under selected conditions, hybrids form in 
which the sequenced specific probe and the second probe hybridize to the target polynucleotide 
such that the second probe displaces the 5* portion of the hybridized sequence-specific probe, 
whereas the 3' portion of the sequence-specific probe and the 5' portion of the second probe 
remain annealed to the selected nucleotide sequence contained within a target nucleic acid. The 
displaced strand, which is a single stranded segment that is not base-paired corresponds to the 
5*-end of the sequence-specific probe then serves as a substrate for cleavage nucleases, thus 
producing discrete digestion products having distinct ratios of charge to translational frictional 
drag that reflect presence of specific sequences on the target polynucleotide. 

Cleaving enzymes recognizing displaced strands are either naturally occurring nucleases 
or modified nucleases. Naturally occurring structure-specific nucleases include, but are not 
limited to Pyrococcus woesii FEN-1 endonuclease, thermostable Methoanococcus jannaschii 
FEN-1 endonucleases, yeast Rad2, and yeast Radl/RadIO complex (Kaiser et al., U.S. Pat. No. 
6,090,606, Cleavage Reagents; Kaiser, et al. U.S. Pat. No. 5,843,669, Cleavage of nucleic acid 
using thermostable Methoanococcus jannaschii FEN-1 endonucleases). Other structure-specific 
enzymes suitable for the cleaving reaction are those derived from modifications of known 
nucleases and polymerases (Dahlberg et al., U.S. Pat. No. 5795763, Synthesis Deficient 
Thermostable DNA Polymerase; Dahlberg et al., U.S. Pat. No. 6,614,402, 5' Nucleases Derived 
from Thermostable DNA Polymerase). Modified polymerase that lack polymerase activity but still 
retain 5'-nuclease activity, are also used as cleaving reagents. 

Another embodiment is directed toward the use of the mobility-modifyfing polymers of 
the present invention in "invader assays," which are SNP-identifying procedures based upon 
flap endonuclease cleavage of structures formed by two overiapping probes that hybridize to a 
target nucleic acid (see e.g. Cooksey et al., 2000, Antimicrobial Agents and Chemotherapy 44: 
1296-1301). Such cleavage reactions release products corresponding to the 5 -terminal 
nucleobase(s) of the "downstream" probe. Where those cleavage products are labeled and can 
be separated from the uncleaved probe, an invader assay can be used to discriminate single 
base differences in, for example, genomic sequences or PCR-amplified genomic sequences. 
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Attachment of the mobility-modifying polymers of the present invention to the labeled 5- 
terminus of the downstream probe used in an invader assay provides detectably-labeled 
cleavage products with distinctive charge to translational frictional drag ratios. Accordingly, a 
plurality of SNP's are analyzed simultaneously using a plurality of sequence-specific 
downstream probes , wherein the sequence-specific downstream probes comprise a mobility- 
modifying polymer of the present invention attached to the labeled 5 -terminus, such that the 
labeled product generated by flap endonuclease cleavage at each SNP has a distinctive charge 
to translational frictional drag ratio. 

In a further aspect of the invader assay, for example, the downstream probe , which 
carries a label and a first mobility-modifying polymer of the present invention attached to the 5'- 
terminus, further comprises a second mobility-modifying polymer attached to the 3*-terminus. 
The presence of the second mobility-modifying polymer increases the sensitivity of the invader 
assay by enhancing the difference between the electrophoretic mobility of the flap endonuclease 
generated product, comprising the 5 -terminus, label, and first mobility-modifying polymer, and 
the electrophoretic mobility of the uncleaved downstream probe. Accordingly, the second 
mobility-modifying polymer has a molecular weight of at least 2000. In other embodiments, the 
second mobility-modifying polymer has a molecular weight of at least 5,000, at least 10,000, at 
least 20,000, and at least 100,000. In one embodiment, the second mobility-modifying polymer 
is a mobility-modifying polymer of the present invention, while in other embodiments, the second 
mobility-modifying polymer is a mobility-modifying polymer of the art, which is, in one illustrative, 
non-limiting example, an uncharged mono methyl polyethyleneglycol polymer. Moreover, the 
second mobility-modifying polymer may comprise a mixture of species of different molecular 
weight, provided that those species do not interfere substantially with detection of the signal 
product, i.e., the flap endonuclease generated product, comprising the 5 -terminus, label, and 
first mobility-modifying polymer (see Example 5, below). 

More generally, in other embodiments of the present invention, invader assays are 
performed in which the downstream oligonucleobase polymer comprises a label and a mobility- 
modifying polymer of the present invention attached to a first region of the downstream 
oligonucleobase polymer, and a second, high-molecular weight mobility-modifying polymer 
attached to a second region of the downstream oligonucleobase polymer, wherein first and 
second regions are separated by the flap endonuclease cleavage site. One aspect of this 
embodiment is described above and in Example 5, wherein the label and mobility-modifying 
polymer of the present invention are attached to the 5'-end of the sequence-specific 
oligonucleobase polymer and a second, high molecular weight mobility-modifying polymer is 
attached to the 3 -end of the sequence-specific oligonucleobase polymer. In other embodiments, 
for example, a second, high molecular weight mobility-modifying polymer is attached, via a linker 
arm nucleotide residue, to the sequence-specific probe , rather than at the 5 -end or 3 -end of 
the sequence-specific probe . Accordingly, the second, high molecular weight mobility-modifying 
polymer, is attached at any nucleobase residue within the second region of the downstream 
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probe , or to the 5 -end or 3*-end, whichever is included within the second region of the 
downstream oligonucieobase polymer. Similarly, in some embodiments, the label, which is a 
fluorescent dye in certain non-limiting examples, is also attached via a linker arm nucleotide 
residue at any nucleobase residue within the first region of the downstream probe. Synthesis of 
such linker arm nucleotides and the coupling of, inter alia, a fluorescent dye or an uncharged 
mono methyl polyethyleneglycol polymer to the linker, are within the scope of the art (see e.g., 
Section 4.5 above). Moreover, e.g., linker arm nucleoside phosphoramidite monomers, as well 
as linker arm nucleoside phosphoramidite monomers comprising flourescent moieties, are 
commercially available (Glen Research, Inc., Sterling, Va.). In these embodiments, the mobility- 
modifying polymer of the present invention is attached to the first region of the downstream 
probe, where the point of attachment may be at the 5'-end or the 3'-end, whichever is 
encompassed within the first region of the downstream probe , or the mobility-modifying polymer 
of the present invention may be incorporated within the first region of the downstream probe , 
providing a molecule according to Structural formula (IV). Therefore, in each of these 
embodiments, the presence of the second high molecular weight mobility-modifying polymer 
attached to the second region of the downstream probe increases the sensitivity of the invader 
assay by enhancing the difference between the electrophoretic mobility of the flap endonuclease 
generated product comprising a label and a mobility-modifying polymer of the present invention, 
i.e., the first region of the downstream oligonucieobase polymer, and the electrophoretic mobility 
of the uncleaved downstream probe . 

In a still further embodiment of an invader assay, the downstream probe carries a label 
and a first mobility-modifying polymer, which is in one non-limiting embodiment, a standard PEO 
mobility-modifying polymer of the art, that is attached to the first region of the downstream probe 
, and a second, high molecular weight mobility-modifying polymer attached to the second region 
of the downstream probe . As above, the presence of the second mobility-modifying polymer 
increases the sensitivity of the invader assay by enhancing the difference between the 
electrophoretic mobility of the flap endonuclease generated product, i.e., the first region of the 
donwstream probe , which comprises a label and a first mobility-modifying polymer, and the 
electrophoretic mobility of the uncleaved downstream probe . Accordingly, the second mobility- 
modifying polymer has a molecular weight of at least 2000. In other embodiments, the second 
mobility-modifying polymer has a molecular weight of at least 5,000, at least 10,000, at least 
20,000, and at least 100,000. In one embodiment, the second mobility-modifying polymer is a 
mobility-modifying polymer of the present invention, while in other embodiments, the second 
mobility-modifying polymer is a mobility-modifying polymer of the art, which is, in one illustrative, 
non-limiting example, an uncharged mono methyl polyethyleneglycol polymer. 

In another aspect of the present invention, the sequence-specific probe serves as a 
cleavage substrate in detection reactions involving multiple sequential cleavage reactions, as 
described in Hall. J. G. et al., U.S. Pat. No. 5,994,069. In this embodiment, a first cleavage 
structure is formed as set forth above, except that in the present embodiment, the first probe is 
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optionally a sequence-specific probe. The reaction mixture further includes a second target 
nucleic acid and a third probe, which is a sequence-specific probe, and further comprises at 
least one attached reporter molecule. The second target polynucleotide has a first, a second 
and a third region, wherein the first region is downstream of the second region, and the second 
region is downstream of the third region. The third probe has a 5' portion fully complementary to 
the second region of the second target polynucleotide and a 3* portion fully complementary to 
the third region of the second target polynucleotide. Treatment of the first cleavage structure 
results in release of a fourth probe, which has a 5* portion complementary to the first region of 
the second target polynucleotide and a 3' portion fully complementary to the second region of 
the second target polynucleotide. This released fourth probe forms a cleavage structure with the 
second target polynucleotide and the third probe under conditions where the 3* portion of the 
third probe and the 5' portion of the fourth probe remains annealed to the second target 
polynucleotide. Cleavage of the third probe with a cleavage reagent generates a fifth and sixth 
probe, either or both of which comprise a reporter molecule and a mobility-modifying polymer, 
thereby providing a digestion product having a distinctive ratio of charge to translational frictional 
drag. The fifth probe is released upon cleavage, while the sixth probe remains hybridized to the 
second target polynucleotide until dissociated by denaturation. Subsequent separation and 
detection of the fifth or sixth probe provides information about the presence of the first and 
second selected nucleotide sequence within the target nucleic acid. 

In a further aspect of the present invention relating to a nucleotide sequence detection 
method involving multiple sequential cleavage reactions, a first cleavage structure is formed by 
first and second probe and a selected nucleotide sequence within a target nucleic acid, as set 
forth above. This aspect of the method further comprises a sequence-specific second target 
probe, which has a first, a second, and a third region, wherein the first region is downstream of 
the second region, and wherein the third region upstream of the second region, is fully self 
complementary and also complementary to the second region, such that it forms a hairpin 
structure under selected conditions. Cleavage of the first cleavage structure with a cleaving 
reagent generates a fourth probe , which has a 5'-portion complementary to the first region and 
a 3 -portion fully complementary to the second region of the probe . Hybridization of the released 
fourth probe to the first and second regions of the sequence-specific probe forms a second 
cleavage structure with a displaced third region that is complementary to the second region. 
Cleavage of this second cleavage structure generates a fifth and sixth probes , either of which 
comprises a mobility-modifying polymer and a label, thereby providing s digestion product 
having a distinctive ratio of charge to translational frictional drag, and whose separation and 
detection provides information about the presence of the first target nucleic acid and the second 
probe . 

Methods for labeling and detecting the cleaved probes, as set forth infra., are equally 
applicable to the labeling and detection of products of the cleavage reactions. Moreover, 
labeling of released cleavage products is also accomplished by extension of the product by 
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template independent polymerases, including but not limited to terminal transferase and polyA 
polymerase as described in U.S. Pat. No. 6.090,606, which is hereby specifically incorporated 
by reference. 

In yet another aspect, the probes of the present invention are employed within a general 
method to effect the electrophoretic analysis and/or separation of target nucleic acids of identical 
or different sizes in non-sieving media. Normally, nucleic acids of different length (i.e. consisting 
of different numbers of nucleobase residues) display an essentially invariant ratio of charge to 
translational frictional drag. Accordingly, such molecules cannot be separated electrophoretically 
in non-sieving media. However, attachment of a sequence-specific probe of the present 
invention to target nucleic acids of identical or different length alters their ratio of charge to 
translational frictional drag of the target nucleic acids in a manner and to a degree sufficient to 
effect their electrophoretic mobility and separation in non-sieving media. Furthermore, and in 
contrast to electrophoretic separations in sieving media, longer nucleic acids to which a 
sequence-specific probe of the present invention has been attached will migrate more rapidly 
than a shorter nucleic acid to which the same sequence-specific probe has been attached. 
Applicants believe, although without wishing to be held to that belief, that such separations are 
based upon the proportionately smaller effect of attachment of a mobility-modifying sequence- 
specific probe of defined mass and size to a longer chain nucleic acid molecule than to a shorter 
chain nucleic acid molecule. Consequently, the ratio of charge to frictional translational drag will 
be greater for the longer chain, providing the longer chain nucleic acid with a greater velocity in 
an electric field. 

In yet a further aspect, to affect the electrophoretic analysis and/or separation of target 
nucleic acids sieving media can be employed. Attachment of a sequence-specific probe of the 
present invention to target nucleic acids of identical or different length alters their ratio of charge 
to translational frictional drag of the target nucleic acids in a manner and to a degree sufficient to 
effect their electrophoretic mobility and separation in sieving media providing additional 
emphasis on the size of the probe -target complex over any net charge affects of the 
association. 

Attachment of a to a population of nucleic acids of different length can be 
accomplished using a variety of approaches, including but not limited to hybridization, 
crosslinking of hybridization complexes, enzymatic ligation or direct, synthetic incorporation of 
the mobility-modifying of the present invention into the population of nucleic acids of different 
lengths that are to be separated. 

In one aspect of this method, a mobility-modifying sequence-specific probe is 
enzymatically ligated to a population of nucleic acids of different length but having a common 
nucleotide sequence at the 5*-end, as is seen within the products of a chain termination nucleic 
acid sequencing reaction or, effectively, in chemical cleavage sequencing reactions which are 
transparent to all sequences other than those comprising the labeled 5'-end of the nucleic acid 
substrate. In this embodiment a synthetic template oligonucleotide, having two distinct sequence 
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regions would be used as a template to align the hybridized 3'-end of a mobility-modifying 
sequence-specific probe so that it would directly abut the hybridized 5 -end, which is generally 
phosphorylated, that is common to the population of nucleic acids to be separated, and permit 
the two molecules to be covalently joined. Therefore the 5'-region of the synthetic template 
oligonucleotide would consist of a nucleotide sequence complementary to the common 5 -end 
sequence of the molecules to be separated, while the 3 -region of the synthetic template would 
consist of sequences complementary to the 3 -end of the mobility-modifying sequence-specific 
probe to be joined. In another embodiment of this approach, the common 5 -end of the 
population of nucleic acids to be separated corresponds to that generated by a sequence- 
specific restriction endonuclease. Therefore the synthetic template nucleic acid consists of at 
least eight nucleobases, of which at least three would be complementary to a common 5*- 
sequence of the population of molecules to be separated. The design of such template nucleic 
acids, as well as the conditions under which the enzymatic joining of the hybridized target 
nucleic acid and the sequence-specific probe would be carried out, are well known to those of 
ordinary skill in the art. Accordingly, this embodiment of the invention is applicable to any 
population of molecules of different sizes, provided each has a common 5'-end sequence of at 
least three nucleotides, in certain embodiments, at least four nucleotides, and in further 
embodiments, at least eight nucleotides. Similar procedures, wherein the sequence common to 
a population of molecules of different sizes occurs at the 3 -end, and consequently, the mobility- 
modifying sequence-specific nucleic to be attached has a phosphorylated 5 -end with the 
mobility-modifying polymer attached to the 3 -end, are also included within the scope of the 
present invention. 

In a further embodiment, a mobility-modifying sequence-specific probe is synthesized or 
produced so as to be complementary to a nucleotide sequence within, for example, a 
sequencing vector, that is upstream of, i.e. toward the 5 -end of, the binding site of a sequencing 
primer used in Sanger, enzymatic chain termination sequencing reaction. In this embodiment, 
the sequence-specific probe is enzymatically ligated to the sequencing primer either before or 
after extension of the sequencing primer during a chain termination sequencing reaction. In this 
embodiment, the sequence-specific nucleic acid is synthesized so that, once hybridized to the 
template polynucleotide, its 3*-end would either directly abut the 5'-end of the hybridized 
sequencing primer, or that 3*-end would hybridize to sequences upstream of the 5 -end of the 
sequencing primer. In the latter instance, the resulting gap is filled with a nucleic acid 
polymerase and the extended molecule is then enzymatically ligated to the sequencing primer. 

Another embodiment of the invention is related to the separation and/or analysis of probe 
and polynucleotides. Separation and/or analysis of oligonucleotides is effected by 
electrophoresis, chromatography, or mass spectroscopy. In methods employing electrophoresis, 
the format may be thin flat chambers. In another embodiment, the separation and/or analysis is 
carried out by electrophoresis in capillary tubes. In another embodiment, the separation and/or 
analysis is carried out by molecular electrophoresis. The advantages of capillary electrophoresis 
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and/or molecular electrophoresis are efficient heat dissipation, which increases resolution and 
permits rapid separation under high electrical fields. Moreover, the small diameters and/or areas 
of the capillary tubes or other configurations used for molecular electrophoresis allow separation 
of numerous samples in arrays of capillaries. 

Sieving or nonsieving media are applicable to separation of probes including but not 
limited to the reaction products generated in the detection methods disclosed herein. Sieving 
media include covalently crosslinked matrices, such as polyacrylamide crosslinked with bis- 
acrylamide (see e.g. Cohen, A. S. et al. (1988) Rapid separation and purification of 
oligonucleotides by high performance capillary gel electrophoresis, Proc. Natl Acad. Sci USA 85: 
9660; Swerdlow, H. et al., (1990), Capillary gel electrophoresis for rapid, high resolution DNA 
sequencing, Nuc. Acids Res. 18 (6): 1415-1419) or linear polymers, for example 
hydroxypropylmethyicellulose, methyl cellulose, or hydroxylethylcellulose (Zhu et al. (1992), J. 
Chromatogr. 480: 311-319; Nathakarnkitkool, S., etal. (1992), Electrophoresis 13: 18-31). 

In one embodiment, the electrophoretic medium is a non-sieving medium. Although 
polynucleotides are not readily separable in a non-sieving medium, probes and polynucleotides 
have distinctive ratios of charge to translational frictional drag that permit separation in a non- 
sieving media, even when the probe and polynucleotides are of the same length. 

EXAMPLES 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The following specific 
examples are offered by way of illustration and not by way of limiting the remaining disclosure. 

EXAMPLE 1 

Sequence-Specific Double stranded Probe Synthesis using PCR 

This protocol describes the materials and methods used in the synthesis of a sequence-specific 
probe with a double stranded tail to which fluorescent tags were added. 

LNA-DNA Chimera Sense Strand Primers 

5' get cag gaa caa aga aac gcA GGG AGA GAG GAA GGA at tea cca gtc aca cga cca (SEQ 
ID NO:1) 

5' cag taa cag ata caa act caA GGG AGA GAG GAA GGA at tea cca gtc aca cga cca (SEQ 
ID NO:2) 

Blocking Oligo 

5' CTCCTTCCTCTCTCCCT (SEQ ID NO:3) 
DNA Antisense Primers 
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(Seq. Id. No:4 will generate a 5kb product and Seq. Id. No:5 will generate a 2.6kb product) 
5' attgatgccaccttttcagc (SEQ ID NO:4) 

5' ggtgctgctatcgatggttt(SEQ ID NO:5) 

UPPER CASE = LNA. lower case = DNA 

Annealing LNA-DNA chimera with Blocker LNA oligo 

The LNA-DNA chimera and LNA blocker oligo were mixed in 10 mM Tris-HCI (pH 7.4), 0.1 mM 
EDTA, overlaided with mineral oil, and annealed by heating the mixture to 105 ' C for 2 minutes 
and slowly cooling to 25 *C over 45 minutes in an MJ Research PTC-200 DNA Engine. 

A reaction mixture containing 0.1 DM annealed LNA-DNA chimera/Blocker hetero-dimer, 0.1 DM 
antisense primer, IX NovaTaq^"^ Buffer +MgCl2, 0.2mM dGTP, 0.2mM dCTP, 0.2 mM dATP, 
SODM dTTP, 0.3mM Amino-allyl-dUTP, 4% Dimethylsulfoxide, 35 pM double stranded 
M13mp18 RF I DNA, and 0.05 Units/DL NovaTaq^"^ DNA Polymerase was prepared to a final 
volume of 100 DL in a 0.5 mL tube adapted for a thermal cycling reaction. The reaction was 
thermocycled as follows: denaturation at 95 "C for 2 min, followed by repeated cycles of: 95 ''C 
for 15 sec and 65-66 'C for 4.5 min. for 30 cycles; 74 for 4 min, 4 ''C hold cycle. 

The reaction was purified over a Qiagen MinElute™ spin column using the protocol described in 
the kit handbook for the MinElute PGR Purification Kit (April 2001 ). The product was eluted in 
11 DL of nuclease-free water. Fluorescent dye was coupled to the probes (see Fig. 1). 

EXAMPLE 2 

Sequence specific Single-stranded Probe Synthesis Reaction 

In this example, 0.15 DM oligonucleotide 5' ggtcagtgccttgagtaacagt (SEQ ID NO:6), 
complementary to nucleotides 2090 to 21 12 of the phage strand of M13mp18, is used to prime 
a single strand DNA synthesis reaction using 0.2 mM unmodified dNTPs, IxNovaTaq Buffer, 
and NovaTaq polymerase (Novagen, San Diego, CA) off of 1 ng/Dl M13mp18 linearized at 
position 4131 with Pad. The reaction was cycled through 20-35 cycles of 94 ''C for 15 seconds, 
40-60°C for 30 seconds, and 68-72'*C for 60 seconds to yield a single strand DNA product of 
5230 bases in length (see Fig. 2). The approximately 5kb single strand DNA fragment was then 
fluorescently labeled dye. 

The dye labeled single strand probe was analyzed by single molecule detection, with having a 
range of intensities from 25-80 photons with a background of approximately 18 photons as 
shown in Fig. 3. 
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EXAMPLE 3 
Ligated Probes 

In this example, an oligonucleotide adaptor complex made up of a recognition oligonucleotide 
and a stitching oligonucleotide will be ligated onto a longer, fluorescently labeled, doubled strand 
DNA molecule. 

UPPER CASE = LNA, lower case = DNA 

The recognition oligo, 5' aGggAagAaaGcgAaaGgaggcTgccagcgacgag (SEQ ID NO:7), has a 20 
base 5' recognition sequence complimentary to position 5,574-5,594 on the phage strand (or + 
strand) of M13mp18. The 3' end is complementary to the stitching oligo, 3' cgacggtcgctgctctcga 
5' (SEQ ID NO:8). These two oligonucleotides were annealed by combining 5 DM of each 
olignucleotide in 10 mM Tris pH 8.0, 50 mM NaCI, 1 mM EDTA in a 1.5 ml tube. The tube was 
placed in a 95''C heat block that was cooled to room temperature over a one hour period. 

To make the double strand tail DNA molecule to be ligated to the oligo adaptor complex, 
pBR322 was treated with the restriction enzyme Dral to yield two fragments with blunt ends. 
The DNA was then treated with shrimp alkaline phosphatase to remove the 5' phosphate on the 
blunt ends. Removal of the phosphatase was carried out by heat denatuation at 65 ''C for 15 
minutes, followed by removal on a QIAquick Column (Qiagen, Valencia, CA). Restriction digest 
was then carried out using Hindi II to yield various products including a 447 bp fragment with one 
Hindlll 5' overhang end and one blunt, dephosphorolated end. 

This 447 bp fragment was gel purified and ligated to the oligonucleotide adapter duplex using 
the LigaFast Rapid DNA Ligation System (Promega, Madison, Wl). As shown in Fig. 4, the 
ligation reaction yields three products. The oligo adapter complex/447bp fragment heterodimer 
(466bp+20ss bases) is shown as the predominant band in the ligation containing the oligo 
adapter complex, the 447bp fragment, and ligase (Fig 4. Lane 1). The 447bp fragment 
homodimer (898bp) is visible as a faint secondary product in this ligation (Fig 4. Lane 1) and as 
the predominant species in the control ligation that lacks the oligo adapter complex (Fig 4. Lane 
4). The oligonucleotide adapter complex also ligates to form a homodimer (34bp + 40ss bases) 
(data not shown). 

The desired heterodimer product was gel purified after running the ligation reaction on a 5% 
agarose. IxTAE gel and was fluorescently labeled. This final product has a 20 base single 
strand of LNA-DNA nucleotides at the 5' end that serves as a sequence specific probe for 
M13mp18, and has a -460 base dye labeled double strand portion that serves as the 
fluorescent tail. 
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The ligated 447 bp DNA fragment was then analyzed by single molecule detection. The 
molecules had a range of intensities from 15-40 photons with a background of approximately 10 
photons as shown in Fig. 5. The buffer only control has no events over 10 photons. 

EXAMPLE 4 

In this example, DNA of two different lengths were labeled with fluorescent dye and analyzed on 
the single molecule detection instrument to demonstrate the concept that varying tail length 
provides a method for controlling the fluorescence intensity of probes made using methods 
described in EXAMPLES 1. 2. and 3. 

The following oligos were used to synthesize PGR products of two different sizes from 
M13mp18DNA. 

Sense Strand Primer 

5* attcaccagtcacacgacca (SEQ. ID. NO:9) 

DNA Antisense Primers 

5 'aagccggaggttaaaaaggt (SEQ. ID. NO: 10) 1 kb 
5' ggtgctgctatcgatggttt (SEQ. ID. NO:11) 2.6kb 

A Ikb fragment was generated using SEQ. ID. NO:9 and SEQ. ID. NO: 10 as primer pair. A 
2.6kb fragment was generated using SEQ. ID. NO:9 and SEQ. ID. NO:1 1 as a primer pair. 
Reaction mixtures were made up of 0.1 DM each primer, IxNovaTaq Buffer, 0.2mM dGTP, 
0.2mM dCTP, 0.2mM dATP, 50 DM dTTP, 0.3mM amino-allyl-dUTP, 4% Dimethylsulfoxide, 
35pM M13mp18 RFI DNA, and 0.05 Units/DL of NovaTaq DNA Polymerase. The reactions 
were cycled through 30 cycles of 94^0 for 15 seconds, 40-60°C for 30 seconds, and 68-72''C for 
60 seconds. The resulting PGR products were purified away from unincorporated amino-allyl- 
dUTP on QIAquick columns (Qiagen, Valencia. CA). Fluorescent dye was coupled to the 
probes. 

Each of the two samples were diluted into buffer and run on the single molecule instrument. 
Plots of photon intensities verses time are shown in Fig. 6. The buffer only plot (Fig. 6A) shows 
the system background to be approximately 5 photons. The Ikb DNA tail (Fig. 6B) had events 
ranging from 6 to 15 photons (up to - 3 times the system background). The 2.6kb DNA tail (Fig. 
6C) had events ranging from 6 to 40 photons (up to --8 times the system background). These 
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data show that the photon intensities of the molecules increase with the length of the DNA tail 
molecules. 

Other Embodiments 

The detailed description set-forth above is provided to aid those skilled in the art in 
practicing the present invention. However, the invention described and claimed herein is not to 
be limited in scope by the specific embodiments herein disclosed because these embodiments 
are intended as illustration of several aspects of the invention. Any equivalent embodiments are 
intended to be within the scope of this invention. Indeed, various modifications of the invention 
in addition to those shown and described herein will become apparent to those skilled in the art 
from the foregoing description which do not depart from the spirit or scope of the present 
inventive discovery. Such modifications are also intended to fall within the scope of the 
appended claims. 
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